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Compositions and Methods for Use in 
Recombinational Cloning of Nucleic Acids 



PCT/US00/05432 



BACKGROUND OF THE INVENTION 

Field of the Invention 

The present invention relates generally to recombinant DNA technology. 
More particularly, the present invention relates to compositions and methods for 
use in recombinational cloning of nucleic acid molecules. The invention relates 
specifically to nucleic acid molecules encoding one or more recombination sites 
or one or more partial recombination sites, particularly attB, attP, a//L, and atiR, 
and fragments, mutants, variants and derivatives thereof. The invention also 
relates to such nucleic acid molecules wherein the one or more recombination site 
nucleotide sequences is operably linked to the one or more additional physical or 
functional nucleotide sequences. The invention also relates to vectors comprising 
the nucleic acid molecules of the invention, to host cells comprising the vectors 
or nucleic acid molecules of the invention, to methods of producing polypeptides 
and RNAs encoded by the nucleic acid molecules of the invention, and to 
polypeptides encoded by these nucleic acid molecules or produced by the methods 
of the invention, which may be fusion proteins. The invention also relates to 
antibodies that bind to one or more polypeptides of the invention or epitopes 
thereof, which may be monoclonal or polyclonal antibodies. The invention also 
relates to the use of these nucleic acid molecules, vectors, polypeptides and 
antibodies in methods for recombinational cloning of nucleic acids, in vitro and 
in vivo, to provide chimeric DNA molecules that have particular characteristics 
and/or DNA segments. More particularly, the antibodies of the invention may be 
used to identify and/or purify proteins or fusion proteins encoded by the nucleic 
acid molecules or vectors of the invention, or to identify and/or purify the nucleic 
acid molecules of the invention. 
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RelatedArt 

Site-specific recombinases Site-specific recombinases are proteins that 
are present in many organisms (e.g. viruses and bacteria) and have been 
characterized to have both endonuclease and ligase properties. These 
recombinases (along with associated proteins in some cases) recognize specific 
sequences of bases in DNA and exchange the DNA segments flanking those 
segments. The recombinases and associated proteins are collectively referred to 
as "recombination proteins" (see, e.g., Landy, A., Current Opinion in 
Biotechnology 3:699-707 (1993)). 

Numerous recombination systems from various organisms have been 
described. See, e.g., Hoess et al., Nucleic Acids Research 14(6):22%1 (1986); 
Abremski et al, J. Biol Chem. 2 61(1)391 (1986), Campbell, J. 
Bacteriol 774(23):7495 (1992); Qian^a/.,7. Biol Chem. 267(1 1):7794 (1992); 
Araki et al 9 J. MoL Biol 225(l):25 (1992); Maeser and KahnmannMo/. Gen. 
Genet 230:170-176) (1991); Esposito et al, Nucl Acids Res. 25(18)3605 
(1997). 

Many of these belong to the integrase family of recombinases ( Argos et al 
EMBOJ. 5:433-440 (1986); Voziyanov et al , Nucl Acids Res. 27:930 (1999)). 
Perhaps the best studied of these are the Integrase/atf system from 
bacteriophage X (Landy, A. Current Opinions in Genetics and Devel 3:699-707 
(1993)), the CxdloxP system from bacteriophage PI (Hoess and Abremski (1 990) 
In Nucleic Acids and Molecular Biology, vol 4. Eds.: Eckstein and Lilley, Berlin- 
Heidelberg: Springer- Verlag; pp. 90-109) , and the FLP/FRT system from the 
Saccharomyces cerevisiae 2 \i circle plasmid (Broach et al Cell 29:227-234 
(1982)). 

Backman (U.S. Patent No. 4,673,640) discloses the in vivo use of X 
recombinase to recombine a protein producing DNA segment by enzymatic site- 
specific recombination using wild-type recombination sites attB and attP. 

Hasan and Szybalski (Gene 56: 145-15 1 (1987)) discloses the use of X Int 
recombinase in vivo for intramolecular recombination between wild type attP and 
attB sites which flank a promoter. Because the orientations of these sites are 
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inverted relative to each other, this causes an irreversible flipping of the promoter 
region relative to the gene of interest. 

Palazzolo et al Gene 55:25-36 (1990), discloses phage lambda vectors 
having bacteriophage A arms that contain restriction sites positioned outside a 
cloned DNA sequence and between wild-type loxP sites. Infection ofE. coli cells 
that express the Cre recombinase with these phage vectors results in 
recombination between the loxP sites and the in vivo excision of the plasmid 
replicon, including the cloned cDNA. 

Posfai etal (NucL Acids Res. 22:2392-2398 (1994)) discloses a method 
for inserting into genomic DNA partial expression vectors having a selectable 
marker, flanked by two wild-type FRT recognition sequences. FLP site-specific 
recombinase as present in the cells is used to integrate the vectors into the genome 
at predetermined sites. Under conditions where the replicon is functional, this 
cloned genomic DNA can be amplified. 

Bebee etal (U.S. Patent No. 5,434,066) discloses the use of site-specific 
recombinases such as Cre for DNA containing two loxP sites for in vivo 
recombination between the sites. 

Boyd (NucL Acids Res. 27:817-821 (1993)) discloses a method to 
facilitate the cloning of blunt-ended DNA using conditions that encourage 
intermolecular ligation to a dephosphorylated vector that contains a wild-type loxP 
site acted upon by a Cre site-specific recombinase present in E. coli host cells. 

Waterhouse et al. (WO 93/19172 and Nucleic Acids Res. 21 (9)\2265 
(1993)) disclose an in vivo method where light and heavy chains of a particular 
antibody were cloned in different phage vectors between loxP and loxP 511 sites 
and used to transfect new E. coli cells. Cre, acting in the host cells on the two 
parental molecules (one plasmid, one phage), produced four products in 
equilibrium: two different cointegrates (produced by recombination at either loxP 
or loxP 51 1 sites), and two daughter molecules, one of which was the desired 
product. 

Schlake & Bode (Biochemistry 33:12746-12751 (1994)) discloses an in 
vivo method to exchange expression cassettes at defined chromosomal locations, 
each flanked by a wild type and a spacer-mutated FRT recombination site. A 
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double-reciprocal crossover was mediated in cultured mammalian cells by using 
this FLP/FRT system for site-specific recombination. 

Hartley et al (U.S. Patent No. 5,888,732) disclose compositions and 
methods for recombinational exchange of nucleic acid segments and molecules, 
including for use in recombinational cloning of a variety of nucleic acid molecules 
in vitro and in vivo, using a combination of wildtype and mutated recombination 
sites and recombination proteins. 

Transposases. The family of enzymes, the transposases, has also been 
used to transfer genetic information between replicons. Transposons are 
structurally variable, being described as simple or compound, but typically encode 
the recombinase gene flanked by DNA sequences organized in inverted 
orientations. Integration of transposons can be random or highly specific. 
Representatives such as Tn7, which are highly site-specific, have been applied to 
the in vivo movement of DNA segments between replicons (Lucklow et al, 
J. Virol 57:4566-4579 (1993)). 

Devine and Boeke Nucl Acids Res. 22:3765-3772 (1994), discloses the 
construction of artificial transposons for the insertion of DNA segments, in vitro, 
into recipient DNA molecules. The system makes use of the integrase of yeast 
TY1 virus-like particles. The DNA segment of interest is cloned, using standard 
methods, between the ends of the transposon-like element TY1 In the presence 
of the TY1 integrase, the resulting element integrates randomly into a second 
target DNA molecule. 

Recombination Sites. Also key to the integration/recombination reactions 
mediated by the above-noted recombination proteins and/or transposases are 
recognition sequences, often termed "recombination sites," on the DNA molecules 
participating in the integration/recombination reactions. These recombination sites 
are discrete sections or segments of DNA on the participating nucleic acid 
molecules that are recognized and bound by the recombination proteins during the 
initial stages of integration or recombination. For example, the recombination site 
for Cre recombinase is loxP which is a 34 base pair sequence comprised of two 
13 base pair inverted repeats (serving as the recombinase binding sites) flanking 
an 8 base pair core sequence. See Figure 1 of Sauer, B., Curr. Opin. Biotech. 
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5:521-527 (1994). Other examples of recognition sequences include the attB, 
attP^ attL, and attR sequences which are recognized by the recombination protein 
X Int. attB is an approximately 25 base pair sequence containing two 9 base pair 
core-type Int binding sites and a 7 base pair overlap region, while attP is an 
approximately 240 base pair sequence containing core-type Int binding sites and 
arm-type Int binding sites as well as sites for auxiliary proteins integration host 
factor (IHF), FIS and excisionase (Xis). See Landy, Curr. Opin. Biotech, 
3:699-707 (1993); see also U.S. Patent No. 5,888,732, which is incorporated by 
reference herein. 

DNA cloning. The cloning of DNA segments currently occurs as a daily 
routine in many research labs and as a prerequisite step in many genetic analyses. 
The purpose of these clonings is various, however, two general purposes can be 
considered: (1) the initial cloning of DNA from large DNA or RNA segments 
(chromosomes, YACs, PCR fragments, mRNA, etc.), done in a relative handful 
of known vectors such as pUC, pGem, pBlueScript, and (2) the subcloning of 
these DNA segments into specialized vectors for functional analysis. A great deal 
of time and effort is expended both in the transfer of DNA segments from the 
initial cloning vectors to the more specialized vectors. This transfer is called 
subcloning. 

The basic methods for cloning have been known for many years and have 
changed little during that time. A typical cloning protocol is as follows: 

( 1 ) digest the DNA of interest with one or two restriction enzymes; 

(2) gel purify the DNA segment of interest when known; 

(3) prepare the vector by cutting with appropriate restriction 
enzymes, treating with alkaline phosphatase, gel purify etc., as 
appropriate; 

(4) ligate the DNA segment to the vector, with appropriate 
controls to eliminate background of uncut and self-ligated vector; 

(5) introduce the resulting vector into an E. coli host cell; 

(6) pick selected colonies and grow small cultures overnight; 

(7) make DNA minipreps; and 
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(8) analyze the isolated plasmid on agarose gels (often after 

diagnostic restriction enzyme digestions) or by PCR. 

The specialized vectors used for subcloning DNA segments are 
functionally diverse. These include but are not limited to: vectors for expressing 
nucleic acid molecules in various organisms; for regulating nucleic acid molecule 
expression; for providing tags to aid in protein purification or to allow tracking of 
proteins in cells; for modifying the cloned DNA segment (e.g., generating 
deletions); for the synthesis of probes (e.g., riboprobes); for the preparation of 
templates for DNA sequencing; for the identification of protein coding regions; for 
the fusion of various protein-coding regions; to provide large amounts of the DNA 
of interest, etc. It is common that a particular investigation will involve subcloning 
the DNA segment of interest into several different specialized vectors. 

As known in the art, simple subclonings can be done in one day (e.g., the 
DNA segment is not large and the restriction sites are compatible with those of the 
subcloning vector). However, many other subclonings can take several weeks, 
especially those involving unknown sequences, long fragments, toxic genes, 
unsuitable placement of restriction sites, high backgrounds, impure enzymes, etc. 
Subcloning DNA fragments is thus often viewed as a chore to be done as few 
times as possible. 

Several methods for facilitating the cloning of DNA segments have been 
described, e.g., as in the following references. 

Ferguson, J., etal Gene 7(5:191 (1981), discloses a family of vectors for 
subcloning fragments of yeast DNA. The vectors encode kanamycin resistance. 
Clones of longer yeast DNA segments can be partially digested and ligated into 
the subcloning vectors. If the original cloning vector conveys resistance to 
ampicillin, no purification is necessary prior to transformation, since the selection 
will be for kanamycin. 

Hashimoto-Gotoh, T., etal. Gene ¥7:125 (1986), discloses a subcloning 
vector with unique cloning sites within a streptomycin sensitivity gene; in a 
streptomycin-resistant host, only plasmids with inserts or deletions in the dominant 
sensitivity gene will survive streptomycin selection. 
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Accordingly, traditional subcloning methods, using restriction enzymes and 
ligase, are time consuming and relatively unreliable. Considerable labor is 
expended, and if two or more days later the desired subclone can not be found 
among the candidate plasmids, the entire process must then be repeated with 
alternative conditions attempted. Although site specific recombinases have been 
used to recombine DNA in vivo, the successful use of such enzymes in vitro was 
expected to suffer from several problems. For example, the site specificities and 
efficiencies were expected to differ in vitro; topologically linked products were 
expected; and the topology of the DNA substrates and recombination proteins was 
expected to differ significantly in vitro (see, e.g., Adams et al, J. MoL 
Biol 226:661-73 (1992)). Reactions that could go on for many hours in vivo 
were expected to occur in significantly less time in vitro before the enzymes 
became inactive. In addition, the stabilities of the recombination enzymes after 
incubation for extended periods of time in in vitro reactions was unknown, as 
were the effects of the topologies (i.e., linear, coiled, supercoiled, etc.) of the 
nucleic acid molecules involved in the reaction. Multiple DNA recombination 
products were expected in the biological host used, resulting in unsatisfactory 
reliability, specificity or efficiency of subcloning. Thus, in vitro recombination 
reactions were not expected to be sufficiently efficient to yield the desired levels 
of product. 

Accordingly, there is a long felt need to provide an alternative subcloning 
system that provides advantages over the known use of restriction enzymes 
and ligases. 

SUMMARY OF THE INVENTION 

The present invention relates to nucleic acid molecules encoding one or 
more recombination sites or one or more partial recombination sites, particularly 
attB, attV, atth, and attR, and fragments, mutants, variants and derivatives 
thereof The invention also relates to such nucleic acid molecules comprising one 
or more of the recombination site nucleotide sequences or portions thereof and 
one or more additional physical or functional nucleotide sequences, such as those 
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encoding one or more multiple cloning sites, one or more transcription termination 
sites, one or more transcriptional regulatory sequences (e.g., one or more 
promoters, enhancers, or repressors), one or more translational signal sequences, 
one or more nucleotide sequences encoding a fusion partner protein or peptide 
(e.g., GST, His 6 or thioredoxin), one or more selection markers or modules, one 
or more nucleotide sequences encoding localization signals such as nuclear 
localization signals or secretion signals, one or more origins of replication, one or 
more protease cleavage sites, one or more desired proteins or peptides encoded 
by a gene or a portion of a gene, and one or more 5' or 3' polynucleotide tails 
(particularly a poly-G tail). The invention also relates to such nucleic acid 
molecules wherein the one or more recombination site nucleotide sequences is 
operably linked to the one or more additional physical or functional nucleotide 
sequences. 

The invention also relates to primer nucleic acid molecules comprising the 
recombination site nucleotide sequences of the invention (or portions thereof), and 
to such primer nucleic acid molecules linked to one or more target-specific (e.g., 
one or more gene-specific) primer nucleic acid sequences. Such primers may also 
comprise sequences complementary or homologous to DNA or RNA sequences 
to be amplified, e.g., by PCR, RT-PCR, etc. Such primers may also comprise 
sequences or portions of sequences useful in the expression of protein genes 
(ribosome binding sites, localization signals, protease cleavage sites, repressor 
binding sites, promoters, transcription stops, stop codons, etc.). Said primers may 
also comprise sequences or portions of sequences useful in the manipulation of 
DNA molecules (restriction sites, transposition sites, sequencing primers, etc.). 
The primers of the invention may be used in nucleic acid synthesis and preferably 
are used for amplification (e.g., PCR) of nucleic acid molecules. When the 
primers of the invention include target- or gene-specific sequences (any sequence 
contained within the target to be synthesized or amplified including translation 
signals, gene sequences, stop codons, transcriptional signals (e.g., promoters) and 
the like), amplification or synthesis of target sequences or genes may be 
accomplished. Thus, the invention relates to synthesis of a nucleic acid molecules 
comprising mixing one or more primers of the invention with a nucleic acid 
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template, and incubating said mixture under conditions sufficient to make a first 
nucleic acid molecule complementary to all or a portion of said template. Thus, 
the invention relates specifically to a method of synthesizing a nucleic acid 
molecule comprising: 

(a) mixing a nucleic acid template with a polypeptide having polymerase 
activity and one or more primers comprising one or more 
recombination sites or portions thereof; and 

(b) incubating said mixture under conditions sufficient to synthesize a first 
nucleic acid molecule complementary to all or a portion of said 
template and which preferably comprises one or more recombination 
sites or portions thereof. 

Such method of the invention may further comprise incubating said first 
synthesized nucleic acid molecule under conditions sufficient to synthesize a 
second nucleic acid molecule complementary to all or a portion of said first nucleic 
acid molecule. Such synthesis may provide for a first nucleic acid molecule having 
a recombination site or portion thereof at one or both of its termini. 

In a preferred aspect, for the synthesis of the nucleic acid molecules, at 
least two primers are used wherein each primer comprises a homologous sequence 
at its terminus and/or within internal sequences of each primer (which may have 
a homology length of about 2 to about 500 bases, preferably about 3 to about 100 
bases, about 4 to about 50 bases, about 5 to about 25 bases and most preferably 
about 6 to about 18 base overlap). In a preferred aspect, the first such primer 
comprises at least one target-specific sequence and at least one recombination site 
or portion thereof while the second primer comprises at least one recombination 
site or portion thereof Preferably, the homologous regions between the first and 
second primers comprise at least a portion of the recombination site. In another 
aspect, the homologous regions between the first and second primers may 
comprise one or more additional sequences, e.g., expression signals, translational 
start motifs, or other sequences adding functionality to the desired nucleic acid 
sequence upon amplification. In practice, two pairs of primers prime synthesis or 
amplification of a nucleic acid molecule. In a preferred aspect, all or at least a 
portion of the synthesized or amplified nucleic acid molecule will be homologous 
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to all or a portion of the template and further comprises a recombination site or 
a portion thereof at at least one terminus and preferably both termini of the 
synthesized or amplified molecule. Such synthesized or amplified nucleic acid 
molecule may be double stranded or single stranded and may be used in the 
recombinational cloning methods of the invention. The homologous primers of 
the invention provide a substantial advantage in that one set of the primers may 
be standardized for any synthesis or amplification reaction. That is, the primers 
providing the recombination site sequences (without the target specific sequences) 
can be pre-made and readily available for use. This in practice allows the use of 
shorter custom made primers that contain the target specific sequence needed to 
synthesize or amplify the desired nucleic acid molecule. Thus, this provides 
reduced time and cost in preparing target specific primers (e.g., shorter primers 
containing the target specific sequences can be prepared and used in synthesis 
reactions). The standardized primers, on the other hand, may be produced in mass 
to reduce cost and can be readily provided (e.g., in kits or as a product) to 
facilitate synthesis of the desired nucleic acid molecules. 

Thus, in one preferred aspect, the invention relates to a method of 
synthesizing or amplifying one or more nucleic acid molecules comprising: 

(a) mixing one or more nucleic acid templates with at least one 
polypeptide having polymerase or reverse transcriptase activity 
and at least a first primer comprising a template specific sequence 
(complementary to or capable of hybridizing to said templates) and 
at least a second primer comprising all or a portion of a 
recombination site wherein said at least a portion of said second 
primer is homologous to or complementary to at least a portion of 
said first primer; and 

(b) incubating said mixture under conditions sufficient to synthesize or 
amplify one or more nucleic acid molecules complementary to all 
or a portion of said templates and comprising one or more 

* recombination sites or portions thereof at one and preferably both 
termini of said molecules. 



More specifically, the invention relates to a method of synthesizing or 
amplifying one or more nucleic acid molecules comprising: 

(a) mixing one or more nucleic acid templates with at least one 
polypeptide having polymerase or reverse transcriptase activity 
and at least a first primer comprising a template specific sequence 
(complementary to or capable of hybridizing to said templates) and 
at least a portion of a recombination site, and at least a second 
primer comprising all or a portion of a recombination site wherein 
said at least a portion of said recombination site on said second 
primer is complementary to or homologous to at least a portion of 
said recombination site on said first primer; and 

(b) incubating said mixture under conditions sufficient to synthesize or 
amplify one or more nucleic acid molecules complementary to all 
or a portion of said templates and comprising one or more 
recombination sites or portions thereof at one and preferably both 
termini of said molecules. 

In a more preferred aspect, the invention relates to a method of amplifying 
or synthesizing one or more nucleic acid molecules comprising: 

(a) mixing one or more nucleic acid templates with at least one 
polypeptide having polymerase or reverse transcriptase activity 
and one or more first primers comprising at least a portion of a 
recombination site and a template specific sequence 
(complementary to or capable of hybridizing to said template); 

(b) incubating said mixture under conditions sufficient to synthesize or 
amplify one or more first nucleic acid molecules complementary to 
all or a portion of said templates wherein said molecules comprise 
at least a portion of a recombination site at one and preferably 
both termini of said molecules; 

(c) mixing said molecules with one or more second primers 
comprising one or more recombination sites, wherein said 
recombination sites of said second primers are homologous to or 
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complementary to at least a portion of said recombination sites on 
said first nucleic acid molecules; and 
(d) incubating said mixture under conditions sufficient to synthesize or 
amplify one or more second nucleic acid molecules complementary 
to all or a portion of said first nucleic acid molecules and which 
comprise one or more recombination sites at one and preferably 
both termini of said molecules. 
The invention also relates to vectors comprising the nucleic acid molecules 
of the invention, to host cells comprising the vectors or nucleic acid molecules of 
the invention, to methods of producing polypeptides encoded by the nucleic acid 
molecules of the invention, and to polypeptides encoded by these nucleic acid 
molecules or produced by the methods of the -invention, which may be fusion 
proteins. The invention also relates to antibodies that bind to one or more 
polypeptides of the invention or epitopes thereof, which may be monoclonal or 
polyclonal antibodies. The invention also relates to the use of these nucleic acid 
molecules, primers, vectors, polypeptides and antibodies in methods for 
recombinational cloning of nucleic acids, in vitro and in vivo, to provide chimeric 
DNA molecules that have particular characteristics and/or DNA segments. 

The antibodies of the invention may have particular use to identify and/or 
purify peptides or proteins (including fusion proteins produced by the invention), 
and to identify and/or purify the nucleic acid molecules of the invention or 
portions thereof 

The methods for in vitro or in vivo recombinational cloning of nucleic acid 
molecule generally relate to recombination between at least a first nucleic acid 
molecule having at least one recombination site and a second nucleic acid 
molecule having at least one recombination site to provide a chimeric nucleic acid 
molecule. In one aspect, the methods relate to recombination between and first 
vector having at least one recombination site and a second vector having at least 
one recombination site to provide a chimeric vector. In another aspect, a nucleic 
acid molecule having at least one recombination site is combined with a vector 
having at least one recombination site to provide a chimeric vector. In a most 
preferred aspect, the nucleic acid molecules or vectors used in recombination 
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comprise two or more recombination sites. In a more specific embodiment of the 
invention, the recombination methods relate to a Destination Reaction (also 
referred to herein as an "LR reaction") in which recombination occurs between an 
Entry clone and a Destination Vector. Such a reaction transfers the nucleic acid 
molecule of interest from the Entry Clone into the Destination Vector to create an 
Expression Clone. The methods of the invention also specifically relate to an 
Entry or Gateward reaction (also referred to herein as a "BP reaction") in which 
an Expression Clone is recombined with a Donor vector to produce an Entry 
clone. In other aspects, the invention relates to methods to prepare Entry clones 
by combining an Entry vector with at least one nucleic acid molecule (e.g., gene 
or portion of a gene). The invention also relates to conversion of a desired vector 
into a Destination Vector by including one or more (preferably at least two) 
recombination sites in the vector of interest. In a more preferred aspect, a nucleic 
acid molecule {e.g., a cassette) having at least two recombination sites flanking a 
selectable marker {e.g., a toxic gene or a genetic element preventing the survival 
of a host cell containing that gene or element, and/or preventing replication, 
partition or heritability of a nucleic acid molecule {e.g., a vector or plasmid) 
comprising that gene or element) is added to the vector to make a Destination 
Vector of the invention. 

Preferred vectors for use in the invention include prokaryotic vectors, 
eukaryotic vectors, or vectors which may shuttle between various prokaryotic 
and/or eukaryotic systems (e.g. shuttle vectors). Preferred prokaryotic vectors for 
use in the invention include but are not limited to vectors which may propagate 
and/or replicate in gram negative and/or gram positive bacteria, including bacteria 
of the generaEscherichia, Salmonella, Proteus, Clostridium, Klebsiella, Bacillus, 
Streptomyces, andPseudomonas and preferably in the species E. coli. Eukaryotic 
vectors for use in the invention include vectors which propagate and/or replicate 
and yeast cells, plant cells, mammalian cells, (particularly human and mouse), 
fungal cells, insect cells, nematode cells, fish cells and the like. Particular vectors 
of interest include but are not limited to cloning vectors, sequencing vectors, 
expression vectors, fusion vectors, two-hybrid vectors, gene therapy vectors, 
phage display vectors, gene-targeting vectors, PACs, BACs, YACs, MACs, and 
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reverse two-hybrid vectors. Such vectors may be used in prokaryotic and/or 
eukaryotic systems depending on the particular vector. 

In another aspect, the invention relates to kits which may be used in 
carrying out the methods of the invention, and more specifically relates to cloning 
or subcloning kits and kits for carrying out the LR Reaction (e.g., making an 
Expression Clone), for carrying out the BP Reaction (e.g., making an Entry 
Clone), and for making Entry Clone and Destination Vector molecules of the 
invention. Such kits may comprise a carrier or receptacle being compartmentalized 
to receive and hold therein any number of containers. Such containers may 
contain any number of components for carrying out the methods of the invention 
or combinations of such components. In particular, a kit of the invention may 
comprise one or more components (or combinations thereof) selected from the 
group consisting of one or more recombination proteins or auxiliary factors or 
combinations thereof, one or more compositions comprising one or more 
recombination proteins or auxiliary factors or combinations thereof (for example, 
GATEWAY™ LR Clonase™ Enzyme Mix or GATEWAY™ BP Clonase™ 
Enzyme Mix), one or more reaction buffers, one or more nucleotides, one or more 
primers of the invention, one or more restriction enzymes, one or more ligases, 
one or more polypeptides having polymerase activity (e.g., one or more reverse 
transcriptases or DNA polymerases), one or more proteinases (e.g. , proteinase K 
or other proteinases), one or more Destination Vector molecules, one or more 
Entry Clone molecules, one or more host cells (e.g. competent cells, such as E. 
coli cells, yeast cells, animal cells (including mammalian cells, insect cells, 
nematode cells, avian cells, fish cells, etc), plant cells, and most particularly 
E. coli DB3.1 host cells, such as E. coli LIBRARY EFFICIENCY® DB3.1™ 
Competent Cells), instructions for using the kits of the invention (e.g. , to carry out 
the methods of the invention), and the like. In related aspects, the kits of the 
invention may comprise one or more nucleic acid molecules encoding one or more 
recombination sites or portions thereof, particularly one or more nucleic acid 
molecules comprising a nucleotide sequence encoding the one or more 
recombination sites or portions thereof of the invention. Preferably, such nucleic 
acid molecules comprise at least two recombination sites which flank a selectable 
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marker (e.g. , a toxic gene and/or antibiotic resistance gene). In a preferred aspect, 
such nucleic acid molecules are in the form of a cassette (e.g. , a linear nucleic acid 
molecule comprising one or more and preferably two or more recombination sites 
or portions thereof). 

Kits for inserting or adding recombination sites to nucleic acid molecules 
of interest may comprise one or more nucleases (preferably restriction 
endonucleases), one or more ligases, one or more topoisomerases, one or more 
polymerases, and one or more nucleic acid molecules or adapters comprising one 
or more recombination sites. Kits for integrating recombination sites into one or 
more nucleic acid molecules of interest may comprise one or more components (or 
combinations thereof) selected from the group consisting of one or more 
integration sequences comprising one or more recombination sites. Such 
integration sequences may comprise one or more transposons, integrating viruses, 
homologous recombination sequences, RNA molecules, one or more host cells 
and the like. 

Kits for making the Entry Clone molecules of the invention may comprise 
any or a* number of components and the composition of such kits may vary 
depending on the specific method involved. Such methods may involve inserting 
the nucleic acid molecules of interest into an Entry or Donor Vector by the 
recombinational cloning methods of the invention, or using conventional molecular 
biology techniques (e.g. , restriction enzyme digestion and ligation). In a preferred 
aspect, the Entry Clone is made using nucleic acid amplification or synthesis 
products. Kits for synthesizing Entry Clone molecules from amplification or 
synthesis products may comprise one or more components (or combinations 
thereof) selected from the group consisting of one or more Donor Vectors (e.g., 
one or more attP vectors including, but not limited to, pDONR201 (Figure 49), 
pDONR202 (Figure 50), pDONR203 (Figure 51), pDONR204 (Figure 52), 
pDONR205 (Figure 53), pDONR206 (Figure 53), and the like), one or more 
polypeptides having polymerase activity (preferably DNA polymerases and most 
preferably thermostable DNA polymerases), one or more proteinases, one or more 
reaction buffers, one or more nucleotides, one or more primers comprising one or 
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more recombination sites or portions thereof, and instructions for making one or 
more Entry Clones. 

Kits for making the Destination vectors of the invention may comprise any 
number of components and the compositions of such kits may vary depending on 
the specific method involved. Such methods may include the recombination 
methods of the invention or conventional molecular biology techniques (e.g., 
restriction endonuclease digestion and ligation). In a preferred aspect, the 
Destination vector is made by inserting a nucleic acid molecule comprising at least 
one recombination site (or portion thereof) of the invention (preferably a nucleic 
acid molecule comprising at least two recombination sites or portions thereof 
flanking a selectable marker) into a desired vector to convert the desired vector 
into a Destination vector of the invention. Such kits may comprise at least one 
component (or combinations thereof) selected from the group consisting of one 
or more restriction endonucleases, one or more ligases, one or more polymerases, 
one or more nucleotides, reaction buffers, one or more nucleic acid molecules 
comprising at least one recombination site or portion thereof (preferably at least 
one nucleic acid molecule comprising at least two recombination sites flanking at 
least one selectable marker, such as a cassette comprising at least one selectable 
marker such as antibiotic resistance genes and/or toxic genes), and instructions for 
making such Destination vectors. 

The invention also relates to kits for using the antibodies of the invention 
in identification and/or isolation of peptides and proteins (which may be fusion 
proteins) produced by the nucleic acid molecules of the invention, and for 
identification and/or isolation of the nucleic acid molecules of the invention or 
portions thereof. Such kits may comprise one or more components (or 
combination thereof) selected from the group consisting of one or more antibodies 
of the invention, one or more detectable labels, one or more solid supports and the 
like. 

Other preferred embodiments of the present invention will be apparent to 
one of ordinary skill in light of what is known in the art, in light of the following 
drawings and description of the invention, and in light of the claims. 
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BRIEF DESCRIPTION OF THE DRAWINGS 
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Figure 1 depicts one general method of the present invention, wherein the 
starting (parent) DNA molecules can be circular or linear. The goal is to exchange 
the new subcloning vector D for the original cloning vector B. It is desirable in 
one embodiment to select for AD and against all the other molecules, including the 
Cointegrate. The square and circle are sites of recombination: e.g., lox (such as 
loxP) sites, att sites, etc. For example, segment D can contain expression signals, 
protein fusion domains, new drug markers, new origins of replication, or 
specialized functions for mapping or sequencing DNA. It should be noted that the 
cointegrate molecule contains Segment D (Destination vector) adjacent to 
segment A (Insert), thereby juxtaposing functional elements in D with the insert 
in A. Such molecules can be used directly in vitro (e.g. , if a promoter is positioned 
adjacent to a gene-for in vitro transcription/translation) or in vivo (following 
isolation in a cell capable of propagating ccdB-containing vectors) by selecting for 
the selection markers in Segments B+D. As one skilled in the art will recognize, 
this single step method has utility in certain envisioned applications of the 
invention. 

Figure 2 is a more detailed depiction of the recombinational cloning 
system of the invention, referred to herein as the "GATEWAY™ Cloning 
System." This figure depicts the production of Expression Clones via a 
"Destination Reaction, " which may also be referred to herein as an "LR Reaction. " 
A kan r vector (referred to herein as an "Entry clone") containing a DNA molecule 
of interest (e.g., a gene) localized between an attLl site and an atfLl site is 
reacted with an amp r vector (referred to herein as a "Destination Vector") 
containing a toxic or "death" gene localized between an attKl site and an attR2 
site, in the presence of GATEWAY™ LR Clonase™ Enzyme Mix (a mixture of 
Int, IHF and Xis). After incubation at 25 °C for about 60 minutes, the reaction 
yields an amp r Expression Clone containing the DNA molecule of interest 
localized between an attBl site and an attB2 site, and a kan r byproduct molecule, 
as well as intermediates. The reaction mixture may then be transformed into host 
cells (e.g. y E. coif) and clones containing the nucleic acid molecule of interest may 
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be selected by plating the cells onto ampicillin-containing media and picking amp r 
colonies. 

Figure 3 is a schematic depiction of the cloning of a nucleic acid molecule 
from an Entry clone into multiple types of Destination vectors, to produce a 
variety of Expression Clones. Recombination between a given Entry clone and 
different types of Destination vectors (not shown), via the LR Reaction depicted 
in Figure 2, produces multiple different Expression Clones for use in a variety of 
applications and host cell types. 

Figure 4 is a detailed depiction of the production of Entry Clones via a 
"BP reaction," also referred to herein as an "Entry Reaction" or a "Gateward 
Reaction." In the example shown in this figure, an amp r expression vector 
containing a DNA molecule of interest (e.g., a gene) localized between an attB\ 
site and an att&2 site is reacted with a kan r Donor vector (e.g., an attP vector; 
here, GATEWAY™ pDONR201 (see Figure 49A-C)) containing a toxic or 
"death" gene localized between an attPl site and an attPl site, in the presence of 
GATEWAY™ BP Clonase™ Enzyme Mix (a mixture of Int and IHF). After 
incubation at 25 °C for about 60 minutes, the reaction yields a kan r Entry clone 
containing the DNA molecule of interest localized between an attLl site and an 
atiLl site, and an amp 1 " by-product molecule. The Entry clone may then be 
transformed into host cells (e.g., E. coli) and clones containing the Entry clone 
(and therefore the nucleic acid molecule of interest) may be selected by plating the 
cells onto kanamycin-containing media and picking kan r colonies. Although this 
figure shows an example of use of a kan r Donor vector, it is also possible to use 
Donor vectors containing other selection markers, such as the gentamycin 
resistance or tetracycline resistance markers, as discussed herein. 

Figure 5 is a more detailed schematic depiction of the LR ("Destination") 
reaction (Figure 5 A) and the BP ("Entry" or "Gateward") reaction (Figure 5B) of 
the GATEWAY™ Cloning System, showing the reactants, products and 
byproducts of each reaction. 
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Figure 6 shows the sequences of the attB 1 and attB2 sites flanking a gene 
of interest after subcloning into a Destination Vector to create an Expression 
Clone. 

Figure 7 is a schematic depiction of four ways to make Entry Clones using 
the compositions and methods of the invention: 1 . using restriction enzymes and 
ligase; 2. starting with a cDNA library prepared in an attL Entry Vector; 3. using 
an Expression Clone from a library prepared in an attB Expression Vector via the 
BxP reaction; and 4. recombinational cloning of PCR fragments with terminal attB 
sites, via the BxP reaction. Approaches 3 and 4 rely on recombination with a 
Donor vector (here, an attP vector such as pDONR201 (see Figure 49A-C), 
pDONR202 (see Figure 50A-C), pDONR203 (see Figure 51A-C), pDONR204 
(see Figure 52A-C), pDONR205 (see Figure 53A-C), or pDONR206 (see 
Figure 54A-C), for example) that provides an Entry Clone carrying a selection 
marker such as kan r , gen r , tet r , or the like. 

Figure 8 is a schematic depiction of cloning of a PCR product by a BxP 
(Entry or Gateward) reaction. A PCR product with 25 bp terminal attB sites 
(plus four Gs) is shown as a substrate for the BxP reaction. Recombination 
between the attB-PCR product of a gene and a Donor vector (which donates an 
Entry Vector that carries kan r ) results in an Entry Clone of the PCR product. 

Figure 9 is a listing of the nucleotide sequences of the recombination sites 
designated herein as attBl, ami, attPl, attVl, attL\, attUl, attRl and attRl. 
Sequences are written conventionally, from 5' to 3'. 

Figures 10-20: The plasmid backbone for all the Entry Vectors depicted 
herein is the same, and is shown in Figure 10A for the Entry Vector pENTRl A. 
For other Entry Vectors shown in Figures 1 1-20, only the sequences shown in 
Figure "A" for each figure set {i.e., Figure 11 A, Figure 12 A, etc.) are different 
(within the attLl-attL2 cassettes) from those shown in Figure 10 - the plasmid 
backbone is identical. 

Figure 10 is a schematic depiction of the physical map and cloning sites 
(Figure 10A), and the nucleotide sequence (Figure 10B), of the Entry Vector 
pENTRlA. 
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Figure 11 is a schematic depiction of the cloning sites (Figure 1 1 A) and 
the nucleotide sequence (Figure 1 IB) of the Entry Vector pENTR2B. 

Figure 12 is a schematic depiction of the cloning sites (Figure 12 A) and 
the nucleotide sequence (Figure 12B) of the Entry Vector pENTR3C. 

Figure 13 is a schematic depiction of the cloning sites (Figure 13 A) and 
the nucleotide sequence (Figure 13B) of the Entry Vector pENTKA 

Figure 14 is a schematic depiction of the cloning sites (Figure 14 A) and 
the nucleotide sequence (Figure 14B) of the Entry Vector pENTR5. 

Figure 15 is a schematic depiction of the cloning sites (Figure 15 A) and 
the nucleotide sequence (Figure 15B) of the Entry Vector pENTR6. 

Figure 16 is a schematic depiction of the cloning sites (Figure 16 A) and 
the nucleotide sequence (Figure 16B) of the Entry Vector pENTR7. 

Figure 17 is a schematic depiction of the cloning sites (Figure 17 A) and 
the nucleotide sequence (Figure 17B) of the Entry Vector pENTR8. 

Figure 18 is a schematic depiction of the cloning sites (Figure 18 A) and 
the nucleotide sequence (Figure 18B) of the Entry Vector pENTR9. 

Figure 19 is a schematic depiction of the cloning sites (Figure 19A) and 
the nucleotide sequence (Figure 19B) of the Entry Vector pENTRlO. 

Figure 20 is a schematic depiction of the cloning sites (Figure 20A) and 
the nucleotide sequence (Figure 20B) of the Entry Vector pENTRl 1 . 

Figure 21 is a schematic depiction of the physical map and the Trc 
expression cassette (Figure 21 A) showing the promoter sequences at -35 and at 
-10 from the initiation codon, and the nucleotide sequence (Figure 21B-D), of 
Destination Vector pDESTL This vector may also be referred to as pTrc- 
DEST1. 

Figure 22 is a schematic depiction of the physical map and the His6 
expression cassette (Figure 22 A) showing the promoter sequences at -35 and at 
-10 from the initiation codon, and the nucleotide sequence (Figure 22B-D), of 
Destination Vector pDEST2. This vector may also be referred to as pHis6- 
DEST2. 
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Figure 23 is a schematic depiction of the physical map and the GST 
expression cassette (Figure 23 A) showing the promoter sequences at -35 and at 
-10 from the initiation codon, and the nucleotide sequence (Figure 23B-D), of 
Destination Vector pDEST3. This vector may also be referred to as pGST- 
DEST3. 

Figure 24 is a schematic depiction of the physical map and the His6-Trx 
expression cassette (Figure 24 A) showing the promoter sequences at -35 and at 
- 1 0 from the initiation codon and a TE V protease cleavage site, and the nucleotide 
sequence (Figure 24B-D), of Destination Vector pDEST4. This vector may also 
be referred to as pTrx-DEST4. 

Figure 25 is a schematic depiction of the attRl and attR2 sites 
(Figure 25A), the physical map (Figure 25B), and the nucleotide sequence 
(Figure 25C-D), of Destination Vector pDEST5 This vector may also be referred 
to as pSPORT(+)-DEST5. 

Figure 26 is a schematic depiction of the attRl and attR2 sites 
(Figure 26A), the physical map (Figure 26B), and the nucleotide sequence 
(Figure 26C-D), of Destination Vector pDEST6. This vector may also be referred 
to as pSPORT(-)-DEST6. 

Figure 27 is a schematic depiction of the attRl site, CMV promoter, and 
the physical map (Figure 27 A), and the nucleotide sequence (Figure 27B-C), of 
Destination Vector pDEST7. This vector may also be referred to as pCMV- 
DEST7. 

Figure 28 is a schematic depiction of the attRl site, baculovirus 
polyhedrin promoter, and the physical map (Figure 28A), and the nucleotide 
sequence (Figure 28B-D), of Destination Vector pDEST8. This vector may also 
be referred to as pFastBac-DEST8. 

Figure 29 is a schematic depiction of the attRl site, Semliki Forest Virus 
promoter, and the physical map (Figure 29A), and the nucleotide sequence (Figure 
29B-E), of Destination Vector pDEST9. This vector may also be referred to as 
pSFV-DEST9. 
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Figure 30 is a schematic depiction of the attRl site, baculo virus 
polyhedrin promoter, His6 fusion domain, and the physical map (Figure 3 OA), and 
the nucleotide sequence (Figure 30B-D), of Destination Vector pDESTlO. This 
vector may also be referred to as pFastBacHT-DESTlO. 

Figure 31 is a schematic depiction of the attRl cassette containing a 
tetracycline-regulated CMV promoter and the physical map (Figure 3 1 A), and the 
nucleotide sequence (Figure 31B-D), of Destination Vector pDESTll. This 
vector may also be referred to as pTet-DESTl 1 . 

Figure 32 is a schematic depiction of the attRl site, the start of the mRNA 
of the CMV promoter, and the physical map (Figure 32A), and the nucleotide 
sequence (Figure 32B-D), of Destination Vector pDEST12.2. This vector may 
also be referred to as pCMVneo-DEST12, as pCMV-DEST12, or as pDEST12. 

Figure 33 is a schematic depiction of the attRl site, the AP L promoter, 
and the physical map (Figure 33 A), and the nucleotide sequence (Figure 33B-C), 
of Destination Vector pDEST13. This vector may also be referred to as pA,P L - 
DEST13. 

Figure 34 is a schematic depiction of the attRl site, the T7 promoter, and 
the physical map (Figure 34A), and the nucleotide sequence (Figure 34B-D), of 
Destination Vector pDEST14. This vector may also be referred to as pPT7- 
DEST14. 

Figure 35 is a schematic depiction of the attRl site, the T7 promoter, and 
the N-terminal GST fusion sequence, and the physical map (Figure 3 5 A), and the 
nucleotide sequence (Figure 35B-D), of Destination Vector pDEST15. This 
vector may also be referred to as pT7 GST-DEST15. 

Figure 36 is a schematic depiction of the attRl site, the T7 promoter, and 
the N-terminal thioredoxin fusion sequence, and the physical map (Figure 3 6 A), 
and the nucleotide sequence (Figure 36B-D), of Destination Vector pDEST16. 
This vector may also be referred to as pT7 Trx-DEST16. 

Figure 37 is a schematic depiction of the attRl site, the T7 promoter, and 
the N-terminal His6 fusion sequence, and the physical map (Figure 3 7 A), and the 
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nucleotide sequence (Figure 37B-D), of Destination Vector pDEST17. This 
vector may also be referred to as pT7 His-DEST17. 

Figure 38 is a schematic depiction of the attRl site and the plO 
baculo virus promoter, and the physical map (Figure 3 8 A), and the nucleotide 
sequence (Figure 3 8B-D), of Destination Vector pDESTl 8. This vector may also 
be referred to as pFBpl0-DEST18. 

Figure 39 is a schematic depiction of the attRl site, and the 39k 
baculovirus promoter, and the physical map (Figure 3 9 A), and the nucleotide 
sequence (Figure 39B-D), of Destination Vector pDESTl 9. This vector may also 
be referred to as pFB39k-DEST19. 

Figure 40 is a schematic depiction of the attRl site, the polh baculovirus 
promoter, and the N-terminal GST fusion sequence, and the physical map (Figure 
40A), and the nucleotide sequence (Figure 40B-D), of Destination Vector 
pDEST20. This vector may also be referred to as pFB GST-DEST20. 

Figure 41 is a schematic depiction of a 2-hybrid vector with a DNA- 
binding domain, the attRl site, and the ADH promoter, and the physical map 
(Figure 41 A), and the nucleotide sequence (Figure 41 B-E), of Destination Vector 
pDEST21 . This vector may also be referred to as pDB Leu-DEST21 . 

Figure 42 is a schematic depiction of a 2-hybrid vector with an activation 
domain, the attRl site, and the ADH promoter, and the physical map 
(Figure 42A), and the nucleotide sequence (Figure 42B-D), of Destination Vector 
pDEST22. This vector may also be referred to as pPC86-DEST22. 

Figure 43 is a schematic depiction of the attRl and attR2 sites, the T7 
promoter, and the C-terminal His6 fusion sequence, and the physical map 
(Figure 43 A), and the nucleotide sequence (Figure 43B-D), of Destination Vector 
pDEST23. This vector may also be referred to as pC-term-His6-DEST23 . 

Figure 44 is a schematic depiction of the attRl and attR2 sites, the T7 
promoter, and the C-terminal GST fusion sequence, and the physical map 
(Figure 44A), and the nucleotide sequence (Figure 44B-D), of Destination Vector 
pDEST24. This vector may also be referred to as pC-term-GST-DEST24. 
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Figure 45 is a schematic depiction of the attRl and attR2 sites, the T7 
promoter, and the C -terminal thioredoxin fusion sequence, and the physical map 
(Figure 45 A), and the nucleotide sequence (Figure 45B-D), of Destination Vector 
pDEST25. This vector may also be referred to as pC-term-Trx-DEST25. 

Figure 46 is a schematic depiction of the attRl site, the CMV promoter, 
and an N-terminal His6 fusion sequence, and the physical map (Figure 46A), and 
the nucleotide sequence (Figure 46B-D), of Destination Vector pDEST26. This 
vector may also be referred to as pCMV-SPneo-His-DEST26. 

Figure 47 is a schematic depiction of the attRl site, the CMV promoter, 
and an N-terminal GST fusion sequence, and the physical map (Figure 47A), and 
the nucleotide sequence (Figure 47B-D), of Destination Vector pDEST27. This 
vector may also be referred to as pCMV-Spneo-GST-DEST27. 

Figure 48 is a depiction of the physical map (Figure 48 A), the cloning sites 
(Figure 48B), and the nucleotide sequence (Figure 48C-D), for the attB cloning 
vector plasmid pEXP501 . This vector may also be referred to equivalently herein 
as pCMV-SPORT6, pCMVSPORT6, and pCMVSport6. 

Figure 49 is a depiction of the physical map (Figure 49 A), and the 
nucleotide sequence (Figure 49B-C), for the Donor plasmid pDONR201 which 
donates a kanamycin-resistant vector in the BP Reaction. This vector may also 
be referred to as pAttPkanr Donor Plasmid, or as pAttPkan Donor Plasmid 

Figure 50 is a depiction of the physical map (Figure 5 OA), and the 
nucleotide sequence (Figure 50B-C), for the Donor plasmid pDONR202 which 
donates a kanamycin-resistant vector in the BP Reaction, 

Figure 51 is a depiction of the physical map (Figure 51 A), and the 
nucleotide sequence (Figure 51B-C), for the Donor plasmid pDONR203 which 
donates a kanamycin-resistant vector in the BP Reaction. 

Figure 52 is a depiction of the physical map (Figure 52A), and the 
nucleotide sequence (Figure 52B-C), for the Donor plasmid pDONR204 which 
donates a kanamycin-resistant vector in the BP Reaction. 
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Figure 53 is a depiction of the physical map (Figure 53A), and the 
nucleotide sequence (Figure 53B-C), for the Donor plasmid pDONR205 which 
donates a tetracycline-resistant vector in the BP Reaction. 

Figure 54 is a depiction of the physical map (Figure 54 A), and the 
nucleotide sequence (Figure 54B-C), for the Donor plasmid pDONR206 which 
donates a gentamycin-resistant vector in the BP Reaction. This vector may also 
be referred to as pENTR22 attP Donor Plasmid, pAttPGenr Donor Plasmid, or 
pAttPgent Donor Plasmid. 

Figure 55 depicts the attBl site, and the physical map, of an Entry Clone 
(pENTR7) of CAT subcloned into the Destination Vector pDEST2 (Figure 22). 

Figure 56 depicts the DNA components of Reaction B of the one-tube 
BxP reaction described in Example 16, pEZC7102 and attB-tet-PCR. 

Figure 57 is a physical map of the desired product of Reaction B of the 
one-tube BxP reaction described in Example 16, tetx7102. 

Figure 58 is a physical map of the Destination Vector pEZC8402. 

Figure 59 is a physical map of the expected tef subclone product, 
tetx8402, resulting from the LxR Reaction with tetx7102 (Figure 57) plus 
pEZC8402 (Figure 58). 

Figure 60 is a schematic depiction of the bacteriophage lambda 
recombination pathways in E. coli. 

Figure 61 is a schematic depiction of the DNA molecules participating in 
the LR Reaction. Two different co-integrates form during the LR Reaction (only 
one of which is shown here), depending on whether attLl and attRl or attL2 and 
attR2 are first to recombine. In one aspect, the invention provides directional 
cloning of a nucleic acid molecule of interest, since the recombination sites react 
with specificity (attLl reacts with attRl; attL2 with attR2; attBl with attPl; and 
attB2 with attP2). Thus, positioning of the sites allows construction of desired 
vectors having recombined fragments in the desired orientation. 

Figure 62 is a depiction of native and fusion protein expression using the 
recombinational cloning methods and compositions of the invention. In the upper 
figure depicting native protein expression, all of the translational start signals are 
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included between the attBl and attB2 sites; therefore, these signals must be 
present in the starting Entry Clone. The lower figure depicts fusion protein 
expression (here showing expression with both N-terminal and C-terminal fusion 
tags so that ribosomes read through attB 1 and attB2 to create the fusion protein). 
Unlike native protein expression vectors, N-terminal fusion vectors have their 
translational start signals upstream of the attBl site. 

Figure 63 is a schematic depiction of three GATEWAY™ Cloning System 
cassettes. Three blunt-ended cassettes are depicted which convert standard 
expression vectors to Destination Vectors. Each of the depicted cassettes 
provides amino-terminal fusions in one of three possible reading frames, and each 
has a distinctive restriction cleavage site as shown. 

Figure 64 shows the physical maps of plasmids containing three attR 
reading frame cassettes, pEZC 15101 (reading frame A; Figure 64 A), pEZC 1 5 1 02 
(reading frame B; Figure 64B), and pEZC15103 (reading frame C; Figure 64C). 

Figure 65 depicts the attB primers used for amplifying the tet r and amp r 
genes from pBR322 by the cloning methods of the invention. 

Figure 66 is a table listing the results of recombinational cloning of the tef 
and amp r PCR products made using the primers shown in Figure 65. 

Figure 67 is a graph showing the effect of the number of guanines (G's) 
contained on the 5' end of the PCR primers on the cloning efficiency of PCR 
products. It is noted, however, that other nucleotides besides guanine (including 
A, T, C, U or combinations thereof) may be used as 5' extensions on the PCR 
primers to enhance cloning efficiency of PCR products. 

Figure 68 is a graph showing a titration of various amounts of attP and 
attB reactants in the BxP reaction, and the effects on cloning efficiency of PCR 
products. 

Figure 69 is a series of graphs showing the effects of various weights 
(Figure 69 A) or moles (Figure 69B) of a 256 bp PCR product on formation of 
colonies, and on efficiency of cloning of the 256 bp PCR product into a Donor 
Vector (Figure 69C). 
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Figure 70 is a series of graphs showing the effects of various weights 
(Figure 70A) or moles (Figure 70B) of a 1 kb PCR product on formation of 
colonies, and on efficiency of cloning of the 1 kb PCR product into a Donor 
Vector (Figure 70C). 

Figure 71 is a series of graphs showing the effects of various weights 
(Figure 71A) or moles (Figure 71B) of a 1.4 kb PCR product on formation of 
colonies, and on efficiency of cloning of the 1.4 kb PCR product into a Donor 
Vector (Figure 71C). 

Figure 72 is a series of graphs showing the effects of various weights 
(Figure 72A) or moles (Figure 72B) of a 3.4 kb PCR product on formation of 
colonies, and on efficiency of cloning of the 3.4 kb PCR product into a Donor 
Vector (Figure 72C). 

Figure 73 is a series of graphs showing the effects of various weights 
(Figure 73 A) or moles (Figure 73B) of a 4.6 kb PCR product on formation of 
colonies, and on efficiency of cloning of the 4.6 kb PCR product into a Donor 
Vector (Figure 73 C). 

Figure 74 is photograph of an ethidium bromide-stained gel of a titration 
of a 6.9 kb PCR product in a BxP reaction. 

Figure 75 is a graph showing the effects of various amounts of a 10. 1 kb 
PCR product on formation of colonies upon cloning of the 10. 1 kb PCR product 
into a Donor Vector. 

Figure 76 is photograph of an ethidium bromide-stained gel of a titration 
of a 10. 1 kb PCR product in a BxP reaction. 

Figure 77 is a table summarizing the results of the PCR product cloning 
efficiency experiments depicted in Figures 69-74, for PCR fragments ranging in 
size from 0.256 kb to 6.9 kb. 

Figure 78 is a depiction of the sequences at the ends of attR Cassettes. 
Sequences contributed by the Cm r -ccdB cassette are shown, including the outer 
ends of the flanking attR sites (boxed). The staggered cleavage sites for Int are 
indicated in the boxed regions. Following recombination with an Entry Clone, 
only the outer sequences in attR sites contribute to the resulting attB sites in the 
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Expression Clone. The underlined sequences at both ends dictate the different 
reading frames (reading frames A, B, or C, with two alternative reading frame C 
cassettes depicted) for fusion proteins. 

Figure 79 is a depiction of several different attR cassettes (in reading 
frames A, B, or C) which may provide fusion codons at the amino-terminus of the 
encoded protein. 

Figure 80 illustrates the single-cutting restriction sites in an attR reading 
frame A cassette of the invention. 

Figure 81 illustrates the single-cutting restriction sites in an attR reading 
frame B cassette of the invention. 

Figure 82 illustrates the single-cutting restriction sites in two alternative 
attR reading frame C cassettes of the invention (Figures 82A and 82B) depicted 
in Figure 78. 

Figure 83 shows the physical map (Figure 83 A), and the nucleotide 
sequence (Figure 83B-C), for an attR reading frame C parent plasmid prfC 
Parent III, which contains an attR reading frame C cassette of the invention 
(alternative A in Figures 78 and 82). 

Figure 84 is a physical map of plasmid pEZC130L 

Figure 85 is a physical map of plasmid pEZC1313. 

Figure 86 is a physical map of plasmid pEZ14032. 

Figure 87 is a physical map of plasmid pMAB58. 

Figure 88 is a physical map of plasmid pMAB62. 

Figure 89 is a depiction of a synthesis reaction using two pairs of 
homologous primers of the invention. 

Figure 90 is a schematic depiction of the physical map (Figure 90A), and 
the nucleotide sequence (Figure 90B-D), of Destination Vector pDEST28. 

Figure 91 is a schematic depiction of the physical map (Figure 91 A), and 
the nucleotide sequence (Figure 91B-D), of Destination Vector pDEST29. 

Figure 92 is a schematic depiction of the physical map (Figure 92A), and 
the nucleotide sequence (Figure 92B-D), of Destination Vector pDEST30. 
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Figure 93 is a schematic depiction of the physical map (Figure 93 A), and 
the nucleotide sequence (Figure 93B-D), of Destination Vector pDEST3 1 . 

Figure 94 is a schematic depiction of the physical map (Figure 94A), and 
the nucleotide sequence (Figure 94B-E), of Destination Vector pDEST32. 

Figure 95 is a schematic depiction of the physical map (Figure 95 A), and 
the nucleotide sequence (Figure 95B-D), of Destination Vector pDEST33. 

Figure 96 is a schematic depiction of the physical map (Figure 96 A), and 
the nucleotide sequence (Figure 96B-D), of Destination Vector pDEST34. 

Figure 97 is a depiction of the physical map (Figure 97A), and the 
nucleotide sequence (Figure 97B-C), for the Donor plasmid pDONR207 which 
donates a gentamycin-resistant vector in the BP Reaction. 

Figure 98 is a schematic depiction of the physical map (Figure 98A), and 
the nucleotide sequence (Figure 98B-D), of the 2-hybrid vector pMAB85. 

Figure 99 is a schematic depiction of the physical map (Figure 99A), and 
the nucleotide sequence (Figure 99B-D), of the 2-hybrid vector pMAB86. 



DETAILED DESCRIPTION OF THE INVENTION 

Definitions 

In the description that follows, a number of terms used in recombinant 
DNA technology are utilized extensively. In order to provide a clear and 
consistent understanding of the specification and claims, including the scope to be 
given such terms, the following definitions are provided. 

Byproduct: is a daughter molecule (a new clone produced after the 
second recombination event during the recombinational cloning process) lacking 
the segment which is desired to be cloned or subcloned. 

Cointegrate: is at least one recombination intermediate nucleic acid 
molecule of the present invention that contains both parental (starting) molecules. 
It will usually be linear. In some embodiments it can be circular. RNA and 
polypeptides may be expressed from cointegrates using an appropriate host cell 
strain, for example E. co//DB3.1 (particularly E. coli LIBRARY EFFICIENCY® 
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DB3 1™ Competent Cells), and selecting for both selection markers found on the 
cointegrate molecule. 

Host: is any prokaryotic or eukaryotic organism that can be a recipient of 
the recombinational cloning Product, vector, or nucleic acid molecule of the 
invention. A "host, " as the term is used herein, includes prokaryotic or eukaryotic 
organisms that can be genetically engineered. For examples of such hosts, see 
Maniatis et aL, Molecular Cloning: A Laboratory Manual, Cold Spring Harbor 
Laboratory, Cold Spring Harbor, New York (1982). 

Insert or Inserts : include the desired nucleic acid segment or a population 
of nucleic acid segments (segments of Figure 1) which may be manipulated by 
the methods of the present invention. Thus, the terms Insert(s) are meant to 
include a particular nucleic acid (preferably DNA) segment or a population of 
segments. Such Insert(s) can comprise one or more nucleic acid molecules. 

Insert Donor: is one of the two parental nucleic acid molecules (e.g. 
RNA or DNA) of the present invention which carries the Insert. The Insert Donor 
molecule comprises the Insert flanked on both sides with recombination sites. 
The Insert Donor can be linear or circular. In one embodiment of the invention, 
the Insert Donor is a circular DNA molecule and further comprises a cloning 
vector sequence outside of the recombination signals (see Figure 1). When a 
population of Inserts or population of nucleic acid segments are used to make the 
Insert Donor, a population of Insert Donors results and may be used in accordance 
with the invention. Examples of such Insert Donor molecules are GATEWAY™ 
Entry Vectors, which include but are not limited to those Entry Vectors depicted 
in Figures 10-20, as well as other vectors comprising a gene of interest flanked by 
one or more attL sites {e.g., atthl, atfLl, etc.), or by one or more attB sites (e.g., 
attBl, attBl, etc.) for the production of library clones. 

Product: is one of the desired daughter molecules comprising the A and 
D sequences which is produced after the second recombination event during the 
recombinational cloning process (see Figure 1 ). The Product contains the nucleic 
acid which was to be cloned or subcloned. In accordance with the invention, 
when a population of Insert Donors are used, the resulting population of Product 
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molecules will contain all or a portion of the population of Inserts of the Insert 
Donors and preferably will contain a representative population of the original 
molecules of the Insert Donors. 

Promoter: is a DNA sequence generally described as the 5'-region of a 
gene, located proximal to the start codon. The transcription of an adjacent DNA 
segment is initiated at the promoter region. A repressible promoter's rate of 
transcription decreases in response to a repressing agent. An inducible promoter's 
rate of transcription increases in response to an inducing agent. A constitutive 
promoter's rate of transcription is not specifically regulated, though it can vary 
under the influence of general metabolic conditions. 

Recognition sequence: Recognition sequences are particular sequences 
which a protein, chemical compound, DNA, or RNA molecule {e.g., restriction 
endonuclease, a modification methylase, or a recombinase) recognizes and binds. 
In the present invention, a recognition sequence will usually refer to a 
recombination site. For example, the recognition sequence for Cre recombinase 
is loxP which is a 34 base pair sequence comprised of two 13 base pair inverted 
repeats (serving as the recombinase binding sites) flanking an 8 base pair core 
sequence. See Figure 1 of Sauer, B., Current Opinion in Biotechnology 
5:521-527 (1994). Other examples of recognition sequences are the attB, attP, 
attL, and attR sequences which are recognized by the recombinase enzyme 
A Integrase. attB is an approximately 25 base pair sequence containing two 9 base 
pair core-type Int binding sites and a 7 base pair overlap region. attP is an 
approximately 240 base pair sequence containing core-type Int binding sites and 
arm-type Int binding sites as well as sites for auxiliary proteins integration host 
factor (IHF), FIS and excisionase (Xis). See Landy, Current Opinion in 
Biotechnology 3:699-707 (1993). Such sites may also be engineered according 
to the present invention to enhance production of products in the methods of the 
invention. When such engineered sites lack the PI or HI domains to make the 
recombination reactions irreversible (e.g., attR or attP), such sites may be 
designated attR' or attP' to show that the domains of these sites have been 
modified in some way. 



WO 00/52027 PCT/US00/05432 

-32- 

Recombination proteins: include excisive or integrative proteins, 
enzymes, co-factors or associated proteins that are involved in recombination 
reactions involving one or more recombination sites, which may be wild-type 
proteins (See Landy, Current Opinion in Biotechnology 3:699-707 (1993)), or 
mutants, derivatives (e.g., fusion proteins containing the recombination protein 
sequences or fragments thereof), fragments, and variants thereof 

Recombination site: is a recognition sequence on a DNA molecule 
participating in an integration/recombination reaction by the recombinational 
cloning methods of the invention. Recombination sites are discrete sections or 
segments of DNA on the participating nucleic acid molecules that are recognized 
and bound by a site-specific recombination protein during the initial stages of 
integration or recombination. For example, the recombination site for Cre 
recombinase is loxP which is a 34 base pair sequence comprised of two 13 base 
pair inverted repeats (serving as the recombinase binding sites) flanking an 8 base 
pair core sequence. See Figure 1 of Sauer, B., Curr. Opin. Biotech. 5:521-527 
( 1 994) Other examples of recognition sequences include the attB, attP, attL, and 
attR sequences described herein, and mutants, fragments, variants and derivatives 
thereof, which are recognized by the recombination protein A Int and by the 
auxiliary proteins integration host factor (IHF), FIS and excisionase (Xis). See 
Landy, Curr, Opin. Biotech. 3:699-707 (1993). 

Recombinational Cloning: is a method described herein, whereby 
segments of nucleic acid molecules or populations of such molecules are 
exchanged, inserted, replaced, substituted or modified, in vitro or in vivo. By 
"in vitro" and "in vivo" herein is meant recombinational cloning that is carried out 
outside of host cells (e.g., in cell-free systems) or inside of host cells (e.g., using 
recombination proteins expressed by host cells), respectively. 

Repression cassette: is a nucleic acid segment that contains a repressor 
or a Selectable marker present in the subcloning vector. 

Selectable marker: is a DNA segment that allows one to select for or 
against a molecule (e.g. , a replicon) or a cell that contains it, often under particular 
conditions. These markers can encode an activity, such as, but not limited to, 
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production of RNA, peptide, or protein, or can provide a binding site for RNA, 
peptides, proteins, inorganic and organic compounds or compositions and the like. 
Examples of Selectable markers include but are not limited to : ( 1 ) DN A segments 
that encode products which provide resistance against otherwise toxic compounds 
(e.g., antibiotics); (2) DNA segments that encode products which are otherwise 
lacking in the recipient cell (e.g., tRNA genes, auxotrophic markers); (3) DNA 
segments that encode products which suppress the activity of a gene product; (4) 
DNA segments that encode products which can be readily identified (e.g., 
phenotypic markers such as P-galactosidase, green fluorescent protein (GFP), and 
cell surface proteins); (5) DNA segments that bind products which are otherwise 
detrimental to cell survival and/or function; (6) DNA segments that otherwise 
inhibit the activity of any of the DNA segments described in Nos. 1 -5 above (e.g. , 
antisense oligonucleotides); (7) DNA segments that bind products that modify a 
substrate (e.g. restriction endonucleases); (8) DNA segments that can be used to 
isolate or identify a desired molecule (e.g. specific protein binding sites); (9) DNA 
segments that encode a specific nucleotide sequence which can be otherwise non- 
functional (e.g. , for PCR amplification of subpopulations of molecules); ( 1 0) DNA 
segments, which when absent, directly or indirectly confer resistance or sensitivity 
to particular compounds; (11) DNA segments that encode products which are 
toxic in recipient cells; (12) DNA segments that inhibit replication, partition or 
heritability of nucleic acid molecules that contain them; and/or (13) DNA 
segments that encode conditional replication functions, e.g., replication in certain 
hosts or host cell strains or under certain environmental conditions (e.g., 
temperature, nutritional conditions, etc.). 

Selection scheme: is any method which allows selection, enrichment, or 
identification of a desired Product or Product(s) from a mixture containing an 
Entry Clone or Vector, a Destination Vector, a Donor Vector, an Expression 
Clone or Vector, any intermediates (e.g. a Cointegrate or a replicon), and/or 
Byproducts. The selection schemes of one preferred embodiment have at least 
two components that are either linked or unlinked during recombinational cloning. 
One component is a Selectable marker. The other component controls the 
expression in vitro or in vivo of the Selectable marker, or survival of the cell (or 
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the nucleic acid molecule, e.g., a replicon) harboring the plasmid carrying the 
Selectable marker. Generally, this controlling element will be a repressor or 
inducer of the Selectable marker, but other means for controlling expression or 
activity of the Selectable marker can be used. Whether a repressor or activator 
is used will depend on whether the marker is for a positive or negative selection, 
and the exact arrangement of the various DNA segments, as will be readily 
apparent to those skilled in the art. A preferred requirement is that the selection 
scheme results in selection of or enrichment for only one or more desired 
Products. As defined herein, selecting for a DNA molecule includes (a) selecting 
or enriching for the presence of the desired DNA molecule, and (b) selecting or 
enriching against the presence of DNA molecules that are not the desired DNA 
molecule. 

In one embodiment, the selection schemes (which can be carried out in 
reverse) will take one of three forms, which will be discussed in terms of Figure 1 . 
The first, exemplified herein with a Selectable marker and a repressor therefore, 
selects for molecules having segment D and lacking segment C The second 
selects against molecules having segment Cand for molecules having segment D 
Possible embodiments of the second form would have a DNA segment carrying 
a gene toxic to cells into which the in vitro reaction products are to be introduced. 
A toxic gene can be a DNA that is expressed as a toxic gene product (a toxic 
protein or RNA), or can be toxic in and of itself (In the latter case, the toxic gene 
is understood to carry its classical definition of "heritable trait".) 

Examples of such toxic gene products are well known in the art, and 
include, but are not limited to, restriction endonucleases {e.g., Dpril), apoptosis- 
related genes (e.g. ASK1 or members of the bcl-2/ced-9 family), retroviral genes 
including those of the human immunodeficiency virus (HIV), defensins such as 
NP- 1 , inverted repeats or paired palindromic DNA sequences, bacteriophage lytic 
genes such as those from OX1 74 or bacteriophage T4; antibiotic sensitivity genes 
such as rpsL, antimicrobial sensitivity genes such as pheS, plasmid killer genes, 
eukaryotic transcriptional vector genes that produce a gene product toxic to 
bacteria, such as GATA-1, and genes that kill hosts in the absence of a 
suppressing function, e.g., kicB, ccdB, <I>X174 E (Liu, Q. et al, Curr. Biol 
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8: 1300-1309 (1998)), and other genes that negatively affect replicon stability 
and/or replication. A toxic gene can alternatively be selectable in vitro, e.g., a 
restriction site. 

Many genes coding for restriction endonucleases operably linked to 
inducible promoters are known, and may be used in the present invention. See, 
e.g. U.S. Patent Nos. 4,960,707 (Dpnl and Dpnll), 5,000,333, 5,082,784 and 
5,192,675 (Kpnl); 5, 147,800 (NgoAIlI and NgoAT); 5,179,015 (Fspl and HaeUl): 
5,200,333 (HaeW and Taql), 5,248,605 (Hpall); 5,3 12,746 (Cla\\ 5,23 1,021 and 
5,304,480 (Xhol and XhoIT); 5,334,526 (AM)- 5,470,740 (Afr/I); 5,534,428 
(Sstl/Sacl); 5,202,248 (Ncol); 5,139,942 (Ndel), and 5,098,839 (Pad). See also 
Wilson, G.G., Nucl Acids Res. 79:2539-2566 (1991); and Lunnen, K.D., etaL, 
Gene 74:25-32 (1988). 

In the second form, segment D carries a Selectable marker. The toxic gene 
would eliminate transformants harboring the Vector Donor, Cointegrate, and 
Byproduct molecules, while the Selectable marker can be used to select for cells 
containing the Product and against cells harboring only the Insert Donor. 

The third form selects for cells that have both segments^ and D in cis on 
the same molecule, but not for cells that have both segments in trans on different 
molecules. This could be embodied by a Selectable marker that is split into two 
inactive fragments, one each on segments A and D 

The fragments are so arranged relative to the recombination sites that 
when the segments are brought together by the recombination event, they 
reconstitute a functional Selectable marker. For example, the recombinational 
event can link a promoter with a structural nucleic acid molecule (e.g., a gene), 
can link two fragments of a structural nucleic acid molecule, or can link nucleic 
acid molecules that encode a heterodimeric gene product needed for survival, or 
can link portions of a replicon. 

Site-specific recombinase: is a type of recombinase which typically has 
at least the following four activities (or combinations thereof): (1) recognition of 
one or two specific nucleic acid sequences; (2) cleavage of said sequence or 
sequences; (3) topoisomerase activity involved in strand exchange; and (4) ligase 
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activity to reseal the cleaved strands of nucleic acid. See Sauer, B., Current 
Opinions in Biotechnology 5:521-527 (1994). Conservative site-specific 
recombination is distinguished from homologous recombination and transposition 
by a high degree of sequence specificity for both partners. The strand exchange 
mechanism involves the cleavage and rejoining of specific DNA sequences in the 
absence of DNA synthesis (Landy, A. (1989) Ann. Rev. Biochem. 55:913-949). 

Subcloning vector: is a cloning vector comprising a circular or linear 
nucleic acid molecule which includes preferably an appropriate replicon. In the 
present invention, the subcloning vector (segment D in Figure 1) can also contain 
functional and/or regulatory elements that are desired to be incorporated into the 
final product to act upon or with the cloned DNA Insert (segment A in Figure 1). 
The subcloning vector can also contain a Selectable marker (preferably DNA). 

Vector: is a nucleic acid molecule (preferably DNA) that provides a useful 
biological or biochemical property to an Insert. Examples include plasmids, 
phages, autonomously replicating sequences (ARS), centromeres, and other 
sequences which are able to replicate or be replicated in vitro or in a host cell, or 
to convey a desired nucleic acid segment to a desired location within a host cell. 
A Vector can have one or more restriction endonuclease recognition sites at which 
the sequences can be cut in a determinable fashion without loss of an essential 
biological function of the vector, and into which a nucleic acid fragment can be 
spliced in order to bring about its replication and cloning. Vectors can further 
provide primer sites, e.g., for PCR, transcriptional and/or translational initiation 
and/or regulation sites, recombinational signals, replicons, Selectable markers, etc. 
Clearly, methods of inserting a desired nucleic acid fragment which do not require 
the use of homologous recombination, transpositions or restriction enzymes (such 
as, but not limited to, UDG cloning of PCR fragments (U.S. Patent No. 
5,334,575, entirely incorporated herein by reference), T:A cloning, and the like) 
can also be applied to clone a fragment into a cloning vector to be used according 
to the present invention. The cloning vector can further contain one or more 
selectable markers suitable for use in the identification of cells transformed with 
the cloning vector. 
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Vector Donor: is one of the two parental nucleic acid molecules (e.g. 
RNA or DNA) of the present invention which carries the DNA segments 
comprising the DNA vector which is to become part of the desired Product. The 
Vector Donor comprises a subcloning vector D (or it can be called the cloning 
vector if the Insert Donor does not already contain a cloning vector (e.g. , for PCR 
fragments containing attB sites; see below)) and a segment C flanked by 
recombination sites (see Figure 1). Segments C and/or D can contain elements 
that contribute to selection for the desired Product daughter molecule, as 
described above for selection schemes. The recombination signals can be the same 
or different, and can be acted upon by the same or different recombinases. In 
addition, the Vector Donor can be linear or circular. Examples of such Vector 
Donor molecules include GATEWAY™ Destination Vectors, which include but 
are not limited to those Destination Vectors depicted in Figures 21-47 and 90-96. 

Primer: refers to a single stranded or double stranded oligonucleotide that 
is extended by covalent bonding of nucleotide monomers during amplification or 
polymerization of a nucleic acid molecule (e.g. a DNA molecule). In a preferred 
aspect, a primer comprises one or more recombination sites or portions of such 
recombination sites. Portions of recombination sites comprise at least 2 bases (or 
basepairs, abbreviated herein as "bp"), at least 5-200 bases, at least 10-100 bases, 
at least 15-75 bases, at least 15-50 bases, at least 15-25 bases, or at least 16-25 
bases, of the recombination sites of interest, as described in further detail below 
and in the Examples. When using portions of recombination sites, the missing 
portion of the recombination site may be provided as a template by the newly 
synthesized nucleic acid molecule. Such recombination sites may be located 
within and/or at one or both termini of the primer. Preferably, additional 
sequences are added to the primer adjacent to the recombination site(s) to enhance 
or improve recombination and/or to stabilize the recombination site during 
recombination. Such stabilization sequences may be any sequences (preferably 
G/C rich sequences) of any length. Preferably, such sequences range in size from 
1 to about 1000 bases, 1 to about 500 bases, and 1 to about 100 bases, 1 to about 
60 bases, 1 to about 25, 1 to about 10, 2 to about 10 and preferably about 4 bases. 
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Preferably, such sequences are greater than 1 base in length and preferably greater 
than 2 bases in length. 

Template: refers to double stranded or single stranded nucleic acid 
molecules which are to be amplified, synthesized or sequenced. In the case of 
double stranded molecules, denaturation of its strands to form a first and a second 
strand is preferably performed before these molecules will be amplified, 
synthesized or sequenced, or the double stranded molecule may be used directly 
as a template. For single stranded templates, a primer complementary to a portion 
of the template is hybridized under appropriate conditions and one or more 
polypeptides having polymerase activity (e.g. DNA polymerases and/or reverse 
transcriptases) may then synthesize a nucleic acid molecule complementary to all 
or a portion of said template. Alternatively, for double stranded templates, one 
or more promoters may be used in combination with one or more polymerases to 
make nucleic acid molecules complementary to all or a portion of the template. 
The newly synthesized molecules, according to the invention, may be equal or 
shorter in length than the original template. Additionally, a population of nucleic 
acid templates may be used during synthesis or amplification to produce a 
population of nucleic acid molecules typically representative of the original 
template population. 

Adapter: is an oligonucleotide or nucleic acid fragment or segment 
(preferably DNA) which comprises one or more recombination sites (or portions 
of such recombination sites) which in accordance with the invention can be added 
to a circular or linear Insert Donor molecule as well as other nucleic acid 
molecules described herein. When using portions of recombination sites, the 
missing portion may be provided by the Insert Donor molecule. Such adapters 
may be added at any location within a circular or linear molecule, although the 
adapters are preferably added at or near one or both termini of a linear molecule. 
Preferably, adapters are positioned to be located on both sides (flanking) a 
particular nucleic acid molecule of interest. In accordance with the invention, 
adapters may be added to nucleic acid molecules of interest by standard 
recombinant techniques (e.g. restriction digest and ligation). For example, 
adapters may be added to a circular molecule by first digesting the molecule with 
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an appropriate restriction enzyme, adding the adapter at the cleavage site and 
reforming the circular molecule which contains the adapter(s) at the site of 
cleavage. In other aspects, adapters may be added by homologous recombination, 
by integration of RNA molecules, and the like. Alternatively, adapters may be 
ligated directly to one or more and preferably both termini of a linear molecule 
thereby resulting in linear molecule(s) having adapters at one or both termini. In 
one aspect of the invention, adapters may be added to a population of linear 
molecules, (e.g. a cDNA library or genomic DNA which has been cleaved or 
digested) to form a population of linear molecules containing adapters at one and 
preferably both termini of all or substantial portion of said population. 

Adapter-Primer: is primer molecule which comprises one or more 
recombination sites (or portions of such recombination sites) which in accordance 
with the invention can be added to a circular or linear nucleic acid molecule 
described herein. When using portions of recombination sites, the missing portion 
may be provided by a nucleic acid molecule (e.g., an adapter) of the invention. 
Such adapter-primers may be added at any location within a circular or linear 
molecule, although the adapter-primers are preferably added at or near one or both 
termini of a linear molecule. Examples of such adapter-primers and the use 
thereof in accordance with the methods of the invention are shown in Example 25 
herein. Such adapter-primers may be used to add one or more recombination sites 
or portions thereof to circular or linear nucleic acid molecules in a variety of 
contexts and by a variety of techniques, including but not limited to amplification 
(e.g., PCR), ligation (e.g., enzymatic or chemical/synthetic ligation), 
recombination (e.g. , homologous or non-homologous (illegitimate) recombination) 
and the like. 

Library: refers to a collection of nucleic acid molecules (circular or 
linear). In one embodiment, a library may comprise a plurality (i.e., two or more) 
of DNA molecules, which may or may not be from a common source organism, 
organ, tissue, or cell. In another embodiment, a library is representative of all or 
a portion or a significant portion of the DNA content of an organism (a "genomic" 
library), or a set of nucleic acid molecules representative of all or a portion or a 
significant portion of the expressed nucleic acid molecules (a cDNA library) in a 
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cell, tissue, organ or organism. A library may also comprise random sequences 
made by de novo synthesis, mutagenesis of one or more sequences and the like. 
Such libraries may or may not be contained in one or more vectors. 

Amplification: refers to any in vitro method for increasing a number of 
copies of a nucleotide sequence with the use of a polymerase. Nucleic acid 
amplification results in the incorporation of nucleotides into a DNA and/or RNA 
molecule or primer thereby forming a new molecule complementary to a template. 
The formed nucleic acid molecule and its template can be used as templates to 
synthesize additional nucleic acid molecules. As used herein, one amplification 
reaction may consist of many rounds of replication. DNA amplification reactions 
include, for example, polymerase chain reaction (PCR). One PCR reaction may 
consist of 5-100 "cycles" of denaturation and synthesis of a DNA molecule. 

Oligonucleotide: refers to a synthetic or natural molecule comprising a 
covalently linked sequence of nucleotides which are joined by a phosphodiester 
bond between the 3 ! position of the deoxyribose or ribose of one nucleotide and 
the 5' position of the deoxyribose or ribose of the adjacent nucleotide. This term 
may be used interchangeably herein with the terms "nucleic acid molecule" and 
"polynucleotide," without any of these terms necessarily indicating any particular 
length of the nucleic acid molecule to which the term specifically refers. 

Nucleotide: refers to a base-sugar-phosphate combination. Nucleotides 
are monomeric units of a nucleic acid molecule (DNA and RNA). The term 
nucleotide includes ribonucleoside triphosphates ATP, UTP, CTG, GTP and 
deoxyribonucleoside triphosphates such as dATP, dCTP, dITP, dUTP, dGTP, 
dTTP, or derivatives thereof Such derivatives include, for example, [aS]dATP, 
7-deaza-dGTP and 7-deaza-dATP. The term nucleotide as used herein also refers 
to dideoxyribonucleoside triphosphates (ddNTPs) and their derivatives. Illustrated 
examples of dideoxyribonucleoside triphosphates include, but are not limited to, 
ddATP, ddCTP, ddGTP, ddlTP, and ddTTP. According to the present invention, 
a "nucleotide" may be unlabeled or detectably labeled by well known techniques. 
Detectable labels include, for example, radioactive isotopes, fluorescent labels, 
chemiluminescent labels, bioluminescent labels and enzyme labels. 
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Hybridization: The terms "hybridization" and "hybridizing" refers to base 
pairing of two complementary single-stranded nucleic acid molecules (RNA and/or 
DNA) to give a double stranded molecule. As used herein, two nucleic acid 
molecules may be hybridized, although the base pairing is not completely 
complementary. Accordingly, mismatched bases do not prevent hybridization of 
two nucleic acid molecules provided that appropriate conditions, well known in 
the art, are used. In some aspects, hybridization is said to be under "stringent 
conditions." By "stringent conditions" as used herein is meant overnight 
incubation at 42°C in a solution comprising: 50% formamide, 5x SSC (150 
mM NaCl, 15mM trisodium citrate), 50 mM sodium phosphate (pH 7.6), 5x 
Denhardt's solution, 10% dextran sulfate, and 20 g/ml denatured, sheared salmon 
sperm DNA, followed by washing the filters in 0. lx SSC at about 65 °C. 

Other terms used in the fields of recombinant DNA technology and 
molecular and cell biology as used herein will be generally understood by one of 
ordinary skill in the applicable arts. 

Overview 

Two reactions constitute the recombinational cloning system of the present 
invention, referred to herein as the "GATEWAY™ Cloning System," as depicted 
generally in Figure 1 . The first of these reactions, the LR Reaction (Figure 2), 
which may also be referred to interchangeably herein as the Destination 
Reaction, is the main pathway of this system. The LR Reaction is a 
recombination reaction between an Entry vector or clone and a Destination 
Vector, mediated by a cocktail of recombination proteins such as the 
GATEWAY™ LR Clonase™ Enzyme Mix described herein. This reaction 
transfers nucleic acid molecules of interest (which may be genes, cDNAs, cDNA 
libraries, or fragments thereof) from the Entry Clone to an Expression Vector, to 
create an Expression Clone. 

The sites labeled L, R, B , and P are respectively the attL, attR, attB, and 
attP recombination sites for the bacteriophage X recombination proteins that 
constitute the Clonase cocktail (referred to herein variously as "Clonase" or 
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" GATEWAY™ LR Clonase™ Enzyme Mix" (for recombination protein mixtures 
mediating attL x attR recombination reactions, as described herein) or 
"GATEWAY™ BP Clonase™ Enzyme Mix" (for recombination protein mixtures 
mediating attB x attP recombination reactions, as described herein)). The 
Recombinational Cloning reactions are equivalent to concerted, highly specific, 
cutting and ligation reactions. Viewed in this way, the recombination proteins cut 
to the left and right of the nucleic acid molecule of interest in the Entry Clone and 
ligate it into the Destination vector, creating a new Expression Clone. 

The nucleic acid molecule of interest in an Expression Clone is flanked by 
the small attBl and attB2 sites. The orientation and reading frame of the nucleic 
acid molecule of interest are maintained throughout the subcloning, because attLl 
reacts only with attRl, and attL2 reacts only with attR2. Likewise, attBl reacts 
only with attPl, and attB2 reacts only with attP2. Thus, the invention also relates 
to methods of controlled or directional cloning using the recombination sites of the 
invention (or portions thereof), including variants, fragments, mutants and 
derivatives thereof which may have altered or enhanced specificity. The invention 
also relates more generally to any number of recombination site partners or pairs 
(where each recombination site is specific for and interacts with its corresponding 
recombination site). Such recombination sites are preferably made by mutating or 
modifying the recombination site to provide any number of necessary specificities 
(e.g., attB 1-10, attPl-10, attL 1-10, attRl -10, etc.), non-limiting examples of 
which are described in detail in the Examples herein. 

When an aliquot from the recombination reaction is transformed into host 
cells (e.g.,E. coli) and spread on plates containing an appropriate selection agent, 
e.g., an antibiotic such as ampicillin with or without methicillin, cells that take up 
the desired clone form colonies. The unreacted Destination Vector does not give 
ampicillin-resistant colonies, even though it carries the ampicillin-resistance gene, 
because it contains a toxic gene, e.g., ccdB. Thus selection for ampicillin 
resistance selects for E. coli cells that carry the desired product, which usually 
comprise >90% of the colonies on the ampicillin plate. 

To participate in the Recombinational (or "Gateway™") Cloning 
Reaction, a nucleic acid molecule of interest first may be cloned into an Entry 
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Vector, creating an Entry Clone. Multiple options are available for creating Entry 
Clones, including: cloning of PCR sequences with terminal attB recombination 
sites into Entry Vectors; using the GATEWAY™ Cloning System recombination 
reaction; transfer of genes from libraries prepared in Gateway™ Cloning System 
vectors by recombination into Entry Vectors; and cloning of restriction enzyme- 
generated fragments and PCR fragments into Entry Vectors by standard 
recombinant DNA methods. These approaches are discussed in further detail 
herein. 

A key advantage of the GATEWAY™ Cloning System is that a nucleic acid 
molecule of interest (or even a population of nucleic acid molecules of interest) 
present as an Entry Clone can be subcloned in parallel into one or more 
Destination Vectors in a simple reactions for anywhere from about 30 seconds to 
about 60 minutes (preferably about 1-60 minutes, about 1-45 minutes, about 1-30 
minutes, about 2-60 minutes, about 2-45 minutes, about 2-30 minutes, about 1-2 
minutes, about 30-60 minutes, about 45-60 minutes, or about 30-45 minutes). 
Longer reaction times (e.g. , 2-24 hours, or overnight) may increase recombination 
efficiency, particularly where larger nucleic acid molecules are used, as described 
in the Examples herein. Moreover, a high percentage of the colonies obtained 
carry the desired Expression Clone. This process is illustrated schematically in 
Figure 3, which shows an advantage of the invention in which the molecule of 
interest can be moved simultaneously or separately into multiple Destination 
Vectors. In the LR Reaction, one or both of the nucleic acid molecules to be 
recombined may have any topology (e.g., linear, relaxed circular, nicked circular, 
supercoiled, etc.), although one or both are preferably linear. 

The second major pathway of the GATEWAY™ Cloning System is the 
BP Reaction (Figure 4), which may also be referred to interchangeably herein as 
the Entry Reaction or the Gateward Reaction The BP Reaction may 
recombine an Expression Clone with a Donor Plasmid (the counterpart of the 
byproduct in Figure 2). This reaction transfers the nucleic acid molecule of 
interest (which may have any of a variety of topologies, including linear, coiled, 
supercoiled, etc.) in the Expression Clone into an Entry Vector, to produce a new 
Entry Clone. Once this nucleic acid molecule of interest is cloned into an Entry 
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Vector, it can be transferred into new Expression Vectors, through the LR 
Reaction as described above. In the BP Reaction, one or both of the nucleic acid 
molecules to be recombined may have any topology {e.g., linear, relaxed circular, 
nicked circular, supercoiled, etc.), although one or both are preferably linear. 

A useful variation of the BP Reaction permits rapid cloning and expression 
of products of amplification (e.g., PCR) or nucleic acid synthesis. Amplification 
(e.g. , PCR) products synthesized with primers containing terminal 25 bp attB sites 
serve as efficient substrates for the Gateward Cloning reaction. Such amplification 
products may be recombined with a Donor Vector to produce an Entry Clone (see 
Figure 7). The result is an Entry Clone containing the amplification fragment. 
Such Entry Clones can then be recombined with Destination Vectors ~ through 
the LR Reaction - to yield Expression Clones of the PCR product. 

Additional details of the LR Reaction are shown in Figure 5A. The 
GATEWAY™ LR Clonase™ Enzyme Mix that mediates this reaction contains 
lambda recombination proteins Int (Integrase), Xis (Excisionase), and IHF 
(Integration Host Factor). In contrast, the GATEWAY™ BP Clonase™ Enzyme 
Mix, which mediates the BP Reaction (Figure 5B), comprises Int and IHF alone. 

The recombination (att) sites of each vector comprise two distinct 
segments, donated by the parental vectors. The staggered lines dividing the two 
portions of each att site, depicted in Figures 5 A and 5B, represent the seven-base 
staggered cut produced by Int during the recombination reactions. This structure 
is seen in greater detail in Figure 6, which displays the attB recombination 
sequences of an Expression Clone, generated by recombination between the attLl 
and attL2 sites of an Entry Clone and the attRl and attR2 sites of a Destination 
Vector. 

The nucleic acid molecule of interest in the Expression Clone is flanked by 
attB sites: attBl to the left (amino terminus) and attB2 to the right (carboxy 
terminus). The bases in attB 1 to the left of the seven-base staggered cut produced 
by Int are derived from the Destination vector, and the bases to the right of the 
staggered cut are derived from the Entry Vector (see Figure 6). Note that the 
sequence is displayed in triplets corresponding to an open reading frame. If the 
reading frame of the nucleic acid molecule of interest cloned in the Entry Vector 



WO 00/52027 PCT/US00/05432 

-45- 

is in phase with the reading frame shown for attB 1 , amino-terminal protein fusions 
can be made between the nucleic acid molecule of interest and any GATEWAY™ 
Cloning System Destination Vector encoding an amino-terminal fusion domain. 
Entry Vectors and Destination Vectors that enable cloning in all three reading 
frames are described in more detail herein, particularly in the Examples. 

The LR Reaction allows the transfer of a desired nucleic acid molecule of 
interest into new Expression Vectors by recombining a Entry Clone with various 
Destination Vectors. To participate in the LR or Destination Reaction, however, 
a nucleic acid molecule of interest preferably is first converted to a Entry Clone. 
Entry Clones can be made in a number of ways, as shown in Figure 7. 

One approach is to clone the nucleic acid molecule of interest into one or 
more of the Entry Vectors, using standard recombinant DNA methods, with 
restriction enzymes and ligase. The starting DNA fragment can be generated by 
restriction enzyme digestion or as a PCR product. The fragment is cloned 
between the attLl and attL2 recombination sites in the Entry Vector. Note that 
a toxic or "death" gene (e.g., ccdB), provided to minimize background colonies 
from incompletely digested Entry Vector, must be excised and replaced by the 
nucleic acid molecule of interest. 

A second approach to making an Entry Clone (Figure 7) is to make a 
library (genomic or cDNA) in an Entry Vector, as described in detail herein. Such 
libraries may then be transferred into Destination Vectors for expression 
screening, for example in appropriate host cells such as yeast cells or mammalian 
cells. 

A third approach to making Entry Clones (Figure 7) is to use Expression 
Clones obtained from cDNA molecules or libraries prepared in Expression 
Vectors. Such cDNAs or libraries, flanked by attB sites, can be introduced into 
a Entry Vector by recombination with a Donor Vector via the BP Reaction. If 
desired, an entire Expression Clone library can be transferred into the Entry 
Vector through the BP Reaction. Expression Clone cDNA libraries may also be 
constructed in a variety of prokaryotic and eukaryotic GATEWAY™-modified 
vectors (e.g., the pEXPSOl Expression Vector (see Figure 48), and 2-hybrid and 



WO 00/52027 PCT/US00/05432 

-46- 

attB library vectors), as described in detail herein, particularly in the Examples 
below. 

A fourth, and potentially most versatile, approach to making an Entry 
Clone (Figure 7) is to introduce a sequence for a nucleic acid molecule of interest 
into an Entry Vector by amplification (e.g. , PCR) fragment cloning. This method 
is diagramed in Figure 8. The DNA sequence first is amplified (for example, with 
PCR) as outlined in detail below and in the Examples herein, using primers 
containing one or more bp, two or more bp, three or more bp, four or more bp, 
five or more bp, preferably six or more bp, more preferably 6-25 bp (particularly 
12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24 or 25) bp of the attB nucleotide 
sequences (such as, but not limited to, those depicted in Figure 9), and optionally 
one or more, two or more, three or more, four or more, and most preferably four 
or five or more additional terminal nucleotide bases which preferably are guanines. 
The PCR product then may be converted to a Entry Clone by performing a BP 
Reaction, in which the attB-PCR product recombines with a Donor Vector 
containing one or more attP sites. Details of this approach and protocols for PCR 
fragment subcloning are provided in Examples 8 and 21-25. 

A variety of Entry Clones may be produced by these methods, providing 
a wide array of cloning options; a number of specific Entry Vectors are also 
available commercially from Life Technologies, Inc. (Rockville, MD). The 
Examples herein provide a more in-depth description of selected Entry Vectors 
and details of their cloning sites. Choosing the optimal Entry Vector for a 
particular application is discussed in Example 4. 

Entry Vectors and Destination Vectors should be constructed so that the 
amino-terminal region of a nucleic acid molecule of interest (e.g., a gene, cDNA 
library or insert, or fragment thereof) will be positioned next to the attLl site. 
Entry Vectors preferably contain the rrnB transcriptional terminator upstream of 
the attLl site. This sequence ensures that expression of cloned nucleic acid 
molecules of interest is reliably "off' in E. coli, so that even toxic genes can be 
successfully cloned. Thus, Entry Clones may be designed to be transcriptionally 
silent. Note also that Entry Vectors, and hence Entry Clones, may contain the 
kanamycin antibiotic resistance (kan r ) gene to facilitate selection of host cells 
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containing Entry Clones after transformation. In certain applications, however, 
Entry Clones may contain other selection markers, including but not limited to a 
gentamycin resistance (gen r ) or tetracycline resistance (tet r ) gene, to facilitate 
selection of host cells containing Entry Clones after transformation. 

Once a nucleic acid molecule of interest has been cloned into an Entry 
Vector, it may be moved into a Destination Vector. The upper right portion of 
Figure 5 A shows a schematic of a Destination Vector. The thick arrow represents 
some function (often transcription or translation) that will act on the nucleic acid 
molecule of interest in the clone. During the recombination reaction, the region 
between the attRl and attR2 sites, including a toxic or "death" gene (e.g., ccdB), 
is replaced by the DNA segment from the Entry Clone. Selection for 
recombinants that have acquired the ampicillin resistance (amp r ) gene (carried on 
the Destination Vector) and that have also lost the death gene ensures that a high 
percentage (usually >90%) of the resulting colonies will contain the correct insert. 

To move a nucleic acid molecule of interest into a Destination Vector, the 
Destination Vector is mixed with the Entry Clone comprising the desired nucleic 
acid molecule of interest, a cocktail of recombination proteins (e.g., 
GATEWAY™ LR Clonase™ Enzyme Mix) is added, the mixture is incubated 
(preferably at about 25 °C for about 60 minutes, or longer under certain 
circumstances, e.g. for transfer of large nucleic acid molecules, as described 
below) and any standard host cell (including bacterial cells such as E. coli\ animal 
cells such as insect cells, mammalian cells, nematode cells and the like; plant cells; 
and yeast cells) strain is transformed with the reaction mixture. The host cell used 
will be determined by the desired selection (e.g., E. coli DB3.1, available 
commercially from Life Technologies, Inc., allows survival of clones containing 
the ccdB death gene, and thus can be used to select for cointegrate molecules - 
/. e. , molecules that are hybrids between the Entry Clone and Destination Vector). 
The Examples below provide further details and protocols for use of Entry and 
Destination Vectors in transferring nucleic acid molecules of interest and 
expressing RNAs or polypeptides encoded by these nucleic acid molecules in a 
variety of host cells. 
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The cloning system of the invention therefore offers multiple advantages; 
Once a nucleic acid molecule of interest is cloned into the GATEWAY™ 
Cloning System, it can be moved into and out of other vectors with 
complete fidelity of reading frame and orientation. That is, since the 
reactions proceed whereby attLl on the Entry Clone recombines with 
attRl on the Destination Vector, the directionality of the nucleic acid 
molecule of interest is maintained or may be controlled upon transfer from 
the Entry Clone into the Destination Vector. Hence, the GATEWAY™ 
Cloning System provides a powerful and easy method of directional 
cloning of nucleic acid molecule of interest. 

One-step cloning or subcloning: Mix the Entry Clone and the Destination 
Vector with Clonase, incubate, and transform. 

Clone PCR products readily by in vitro recombination, by adding attB 
sites to PCR primers. Then directly transfer these Entry Clones into 
Destination Vectors. This process may also be carried out in one step (see 
Examples below). 

Powerful selections give high reliability: >90% ( and often >99%) of the 
colonies contain the desired DNA in its new vector. 
One-step conversion of existing standard vectors into GATEWAY™ 
Cloning System vectors. 

Ideal for large vectors or those with few cloning sites. 

Recombination sites are short (25 bp), and may be engineered to contain 

no stop codons or secondary structures. 

Reactions may be automated, for high-throughput applications (e.g., for 
diagnostic purposes or for therapeutic candidate screening). 
The reactions are economical: 0.3 \ig of each DNA; no restriction 
enzymes, phosphatase, ligase, or gel purification. Reactions work well 
with miniprep DNA. 

Transfer multiple clones, and even libraries, into one or more Destination 
Vectors, in a single experiment. 

A variety of Destination Vectors may be produced, for applications 
including, but not limited to: 
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•Protein expression in E. coir, native proteins; fusion proteins with GST, 

His6, thioredoxin, etc., for purification, or one or more epitope tags; 

any promoter useful in expressing proteins in E. coli may be used, 

such as ptrc, AP L , and T7 promoters. 
•Protein expression in eukaryotic cells: CMV promoter, baculovirus (with 

or without His6 tag), Semliki Forest virus, Tet regulation. 
•DNA sequencing (all lac primers), RNA probes, phagemids (both 

strands) 

• A variety of Entry Vectors (for recombinational cloning entry by standard 
recombinant DNA methods) may be produced: 

•Strong transcription stop just upstream, for genes toxic to E. coli. 
•Three reading frames. 

•With or without TEV protease cleavage site. 
•Motifs for prokaryotic and / or eukaryotic translation. 
•Compatible with commercial cDNA libraries. 

• Expression Clone cDNA (attB) libraries, for expression screening, including 
2-hybrid libraries and phage display libraries, may also be constructed. 

Recombination Site Sequences 

In one aspect, the invention relates to nucleic acid molecules, which may or 
may not be isolated nucleic acid molecules, comprising one or more nucleotide 
sequences encoding one or more recombination sites or portions thereof In 
particular, this aspect of the invention relates to such nucleic acid molecules 
comprising one or more nucleotide sequences encoding attB, attY, atiL, or attK, 
or portions of these recombination site sequences. The invention also relates to 
mutants, derivatives, and fragments of such nucleic acid molecules. Unless 
otherwise indicated, all nucleotide sequences that may have been determined by 
sequencing a DNA molecule herein were determined using manual or automated 
DNA sequencing, such as dideoxy sequencing, according to methods that are 
routine to one of ordinary skill in the art (Sanger, F., and Coulson, A.R., J. Mol 
Biol 94:444-448 (1975); Sanger, F , etal.,Proc. Natl. Acad Sci. USA 74:5463- 
5467 (1977)). All amino acid sequences of polypeptides encoded by DNA 
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molecules determined herein were predicted by conceptual translation of a DNA 
sequence determined as above. Therefore, as is known in the art for any DNA 
sequence determined by these approaches, any nucleotide sequence determined 
herein may contain some errors. Nucleotide sequences determined by such 
methods are typically at least about 90% identical, more typically at least about 
95% to at least about 99.9% identical to the actual nucleotide sequence of the 
sequenced DNA molecule. As is also known in the art, a single insertion or 
deletion in a determined nucleotide sequence compared to the actual sequence will 
cause a frame shift in translation of the nucleotide sequence such that the predicted 
amino acid sequence encoded by a determined nucleotide sequence will be 
completely different from the amino acid sequence actually encoded by the 
sequenced DNA molecule, beginning at the point of such an insertion or deletion. 

Unless otherwise indicated, each "nucleotide sequence" set forth herein is 
presented as a sequence of deoxyribonucleotides (abbreviated A, G , C and T). 
However, by "nucleotide sequence" of a nucleic acid molecule or polynucleotide 
is intended, for a DNA molecule or polynucleotide, a sequence of 
deoxyribonucleotides, and for an RNA molecule or polynucleotide, the 
corresponding sequence of ribonucleotides (A, G, C andU), where each thymidine 
deoxyribonucleotide (T) in the specified deoxyribonucleotide sequence is replaced 
by the ribonucleotide uridine (U). Thus, the invention relates to sequences of the 
invention in the form of DNA or RNA molecules, or hybrid DNA/RNA molecules, 
and their corresponding complementary DNA, RNA, or DNA/RNA strands. 

In a first such aspect, the invention provides nucleic acid molecules comprising 
one or more nucleotide sequences encoding attB 1 , or mutants, fragments, variants 
or derivatives thereof Such nucleic acid molecules may comprise an attBl 
nucleotide sequence having the sequence set forth in Figure 9, such as: 
AC A AGT T T GT AC A A A A A AGC AGGC T , or a nucleotide sequence 
complementary to the nucleotide sequence set forth in Figure 9 for attBl, or 
mutants, fragments, variants or derivatives thereof. As one of ordinary skill will 
appreciate, however, certain mutations, insertions, or deletions of one or more 
bases in the attBl sequence contained in the nucleic acid molecules of the 
invention may be made without compromising the structural and functional 
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integrity of these molecules; hence, nucleic acid molecules comprising such 
mutations, insertions, or deletions in the attB 1 sequence are encompassed within 
the scope of the invention. 

In a related aspect, the invention provides nucleic acid molecules comprising 
one or more nucleotide sequences encoding attBl, or mutants, fragments, variants 
or derivatives thereof Such nucleic acid molecules may comprise an attBl 
nucleotide sequence having the sequence set forth in Figure 9, such as: 
ACCCAGCTTTCTTGTACAAAGTGGT, or a nucleotide sequence 
complementary to the nucleotide sequence set forth in Figure 9 for attBl, or 
mutants, fragments, variants or derivatives thereof As noted above for attBl, 
certain mutations, insertions, or deletions of one or more bases in the attBl 
sequence contained in the nucleic acid molecules of the invention may be made 
without compromising the structural and functional integrity of these molecules; 
hence, nucleic acid molecules comprising such mutations, insertions, or deletions 
in the attBl sequence are encompassed within the scope of the invention. 

A recombinant host cell comprising a nucleic acid molecule containing attBl 
and attB2 sites (the vector pEXPSOl, also known as pCMVSport6; see 
Figure 48), E. coli DB3.1(pCMVSport6), was deposited on February 27, 1999, 
with the Collection, Agricultural Research Culture Collection (NRRL), 1815 
North University Street, Peoria, Illinois 61604 USA, as Deposit No. NRRL B- 
30108. The attBl and attB2 sites within the deposited nucleic acid molecule are 
contained in nucleic acid cassettes in association with one or more additional 
functional sequences as described in more detail below. 

In another related aspect, the invention provides nucleic acid molecules 
comprising one or more nucleotide sequences encoding atiPl, or mutants, 
fragments, variants or derivatives thereof Such nucleic acid molecules may 
comprise an attPl nucleotide sequence having the sequence set forth in Figure 9, 
such as; TACAGGTCACTAATACCATCTAAGTAGTTGATTCATAGTGA- 
CTGGATATGTTGTGTTTTACAGTATTATGTAGTCTGTTTTTTAT- 
GCAAAATCTAATTTAATATATTGATATTTATATCATTTTACGTT- 
TCTCGTTCAGCTTTTTTGTACAAAGTTGGCATTATAAAAAAGCATTG- 
CTCATCAATTTGTTGCAACGAACAGGTCACTATCAGTCAAAATAA- 
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AATCATTATTTG, or a nucleotide sequence complementary to the nucleotide 
sequence set forth in Figure 9 for atfPl, or mutants, fragments, variants or 
derivatives thereof. As noted above for atfBl, certain mutations, insertions, or 
deletions of one or more bases in the atfPl sequence contained in the nucleic acid 
molecules of the invention may be made without compromising the structural and 
functional integrity of these molecules; hence, nucleic acid molecules comprising 
such mutations, insertions, or deletions in the atfPl sequence are encompassed 
within the scope of the invention. 

In another related aspect, the invention provides nucleic acid molecules 
comprising one or more nucleotide sequences encoding attPl, or mutants, 
fragments, variants or derivatives thereof Such nucleic acid molecules may 
comprise an attPl nucleotide sequence having the sequence set forth in Figure 9, 
such as: CAAATAATGATTTTATTTTGACTGATAGTGACCTGTTCGTTG- 
CAACAAATTGATAAGCAATGCTTTCTTATAATGCCAACTTT- 
GTACAAGAAAGCTGAACGAGAAACGTAAAATGATA- 
TAAATATCAATATATTAAATTAGATTTTGCATAAAAAACAG- 
ACTACATAATACTGTAAAACACAACATATCCAGTCACTATGAATCAA- 
CTACTTAGATGGTATTAGTGACCTGTA, or a nucleotide sequence 
complementary to the nucleotide sequence set forth in Figure 9 for atfP2, or 
mutants, fragments, variants or derivatives thereof . As noted above for atfBl, 
certain mutations, insertions, or deletions of one or more bases in the attPl 
sequence contained in the nucleic acid molecules of the invention may be made 
without compromising the structural and functional integrity of these molecules; 
hence, nucleic acid molecules comprising such mutations, insertions, or deletions 
in the attPl sequence are encompassed within the scope of the invention. 

A recombinant host cell comprising a nucleic acid molecule (the attP vector 
pDONR201, also known as pENTR2 1 -attPkan or pAttPkan; see Figure 49) 
containing attPl and attP2 sites, E. coli DB 3. l(p AttPkan) (also called E. coli 
DB3.1(pAHKan)), was deposited on February 27, 1999, with the Collection, 
Agricultural Research Culture Collection (NRRL), 1815 North University Street, 
Peoria, Illinois 61604 USA, as Deposit No. NRRL B-30099. The attPl and attP2 
sites within the deposited nucleic acid molecule are contained in nucleic acid 
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cassettes in association with one or more additional functional sequences as 
described in more detail below. 

In another related aspect, the invention provides nucleic acid molecules 
comprising one or more nucleotide sequences encoding attRl, or mutants, 
fragments, variants or derivatives thereof. Such nucleic acid molecules may 
comprise an attRl nucleotide sequence having the sequence set forth in Figure 9, 
such as: ACAAGTTTGTACAAAAAAGCTGAACGAG- 
AAACGTAAAATGATATAAATATCAATATATTAAATTAGATTTTGCAT- 
AAAAAACAGACTACATAATACTGTAAAACACAACATATCCAGTCA- 
CTATG, or a nucleotide sequence complementary to the nucleotide sequence set 
forth in Figure 9 for attRl, or mutants, fragments, variants or derivatives thereof 
As noted above for attBl, certain mutations, insertions, or deletions of one or 
more bases in the attRl sequence contained in the nucleic acid molecules of the 
invention may be made without compromising the structural and functional 
integrity of these molecules; hence, nucleic acid molecules comprising such 
mutations, insertions, or deletions in the attRl sequence are encompassed within 
the scope of the invention. 

In another related aspect, the invention provides nucleic acid molecules 
comprising one or more nucleotide sequences encoding attRl, or mutants, 
fragments, variants or derivatives thereof Such nucleic acid molecules may 
comprise an attRl nucleotide sequence having the sequence set forth in Figure 9, 
such as: GCAGGTCGACCATAGTGACTGGATAT- 
GTTGTGTTTTACAGTATTATGTAGTCTGTTTTTTATGCAAAATCTA- 
ATTTAATATATTGATATTTATATCATTTTACGTTTCTCGTTCAGCTT- 
TCTTGTACAAAGTGGT, or a nucleotide sequence complementary to the 
nucleotide sequence set forth in Figure 9 for attRl , or mutants, fragments, variants 
or derivatives thereof As noted above for attB 1 , certain mutations, insertions, or 
deletions of one or more bases in the attRl sequence contained in the nucleic acid 
molecules of the invention may be made without compromising the structural and 
functional integrity of these molecules; hence, nucleic acid molecules comprising 
such mutations, insertions, or deletions in the attRl sequence are encompassed 
within the scope of the invention. 
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Recombinant host cell strains containing attRl sites apposed to cloning sites 
in reading frame A, reading frame B, and reading frame C, E. coli 
DB3.1(pEZC15101) (reading frame A; see Figure 64 A), E. coli 
DB3.1(pEZC15102) (reading frame B; see Figure 64B), and E. coli 
DB3.1(pEZC15103) (reading frame C; see Figure 64C), and containing 
corresponding attR2 sites, were deposited on February 27, 1999, with the 
Collection, Agricultural Research Culture Collection (NRRL), 1815 North 
University Street, Peoria, Illinois 61604 USA, as Deposit Nos. NRRL B-30103, 
NRRL B-30104, and NRRL B-30105, respectively. The attRl and attR2 sites 
within the deposited nucleic acid molecules are contained in nucleic acid cassettes 
in association with one or more additional functional sequences as described in 
more detail below. 

In another related aspect, the invention provides nucleic acid molecules 
comprising one or more nucleotide sequences encoding atiLl, or mutants, 
fragments, variants and derivatives thereof Such nucleic acid molecules may 
comprise an atiLl nucleotide sequence having the sequence set forth in Figure 9, 
such as: CAA ATA ATG ATT TTA TTT TGA CTG ATA GTG ACC TGT TCG 
TTG CAA CAA ATT GAT AAG CAA TGC TTT TTT ATA ATG CCA ACT 
TTG TAC AAA AAA GCA GGC T, or a nucleotide sequence complementary to 
the nucleotide sequence set forth in Figure 9 for atiLl, or mutants, fragments, 
variants or derivatives thereof. As noted above for attBl, certain mutations, 
insertions, or deletions of one or more bases in the atiLl sequence contained in 
the nucleic acid molecules of the invention may be made without compromising 
the structural and functional integrity of these molecules; hence, nucleic acid 
molecules comprising such mutations, insertions, or deletions in the attLl 
sequence are encompassed within the scope of the invention. 

In another related aspect, the invention provides nucleic acid molecules 
comprising one or more nucleotide sequences encoding attLl, or mutants, 
fragments, variants and derivatives thereof. Such nucleic acid molecules may 
comprise an atiLl nucleotide sequence having the sequence set forth in Figure 9, 
such as: C AAA TAA TGA TTT TAT TTT GAC TGA TAG TGA CCT GTT 
CGT TGC AAC AAA TTG ATA AGC AAT GCT TTC TTA TAA TGC CAA 
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CTT TGT AC A AGA AAG CTG GGT, or a nucleotide sequence complementary 
to the nucleotide sequence set forth in Figure 9 for attLl, or mutants, fragments, 
variants or derivatives thereof As noted above for at/Bl, certain mutations, 
insertions, or deletions of one or more bases in the a//L2 sequence contained in 
the nucleic acid molecules of the invention may be made without compromising 
the structural and functional integrity of these molecules; hence, nucleic acid 
molecules comprising such mutations, insertions, or deletions in the att\Jl 
sequence are encompassed within the scope of the invention. 

Recombinant host cell strains containing attLl sites apposed to cloning sites 
in reading frame A, reading frame B, and reading frame C, E. coli 
DB3 1 (pENTRl A) (reading frame A; see Figure 1 0), E. coli DB3 . 1 (pENTR2B) 
(reading frame B; see Figure 1 1 ), and£. coli DB3 . 1 (pENTR3C) (reading frame C; 
see Figure 12), and containing corresponding attL2 sites, were deposited on 
February 27, 1999, with the Collection, Agricultural Research Culture Collection 
(NRRL), 1815 North University Street, Peoria, Illinois 61604 USA, as Deposit 
Nos.NRRLB-30100,NRRLB-30101, and NRRL B-3 01 02, respectively. The 
attLl and attL2 sites within the deposited nucleic acid molecules are contained in 
nucleic acid cassettes in association with one or more additional functional 
sequences as described in more detail below. 

Each of the recombination site sequences described herein or portions thereof, 
or the nucleotide sequence cassettes contained in the deposited clones, may be 
cloned or inserted into a vector of interest (for example, using the recombinational 
cloning methods described herein and/or standard restriction cloning techniques 
that are routine in the art) to generate, for example, Entry Vectors or Destination 
Vectors which may be used to transfer a desired segment of a nucleic acid 
molecule of interest (e.g. , a gene, cDNA molecule, or cDNA library) into a desired 
vector or into a host cell. 

Using the information provided herein, such as the nucleotide sequences for 
the recombination site sequences described herein, an isolated nucleic acid 
molecule of the present invention encoding one or more recombination sites or 
portions thereof may be obtained using standard cloning and screening procedures, 
such as those for cloning cDNAs using mRNA as starting material. Preferred such 
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methods include PCR-based cloning methods, such as reverse transcriptase-PCR 
(RT-PCR) using primers such as those described herein and in the Examples 
below. Alternatively, vectors comprising the cassettes containing the 
recombination site sequences described herein are available commercially from 
Life Technologies, Inc. (Rockville, MD). 

The invention is also directed to nucleic acid molecules comprising one or 
more of the recombination site sequences or portions thereof and one or more 
additional nucleotide sequences, which may encode functional or structural sites 
such as one or more multiple cloning sites, one or more transcription termination 
sites, one or more transcriptional regulatory sequences (which may be promoters, 
enhancers, repressors, and the like), one or more translation^ signals (e.g., 
secretion signal sequences), one or more origins of replication, one or more fusion 
partner peptides (particularly glutathione S-transferase (GST), hexahistidine 
(His 6 ), and thioredoxin (Trx)), one or more selection markers or modules, one or 
more nucleotide sequences encoding localization signals such as nuclear 
localization signals or secretion signals, one or more origins of replication, one or 
more protease cleavage sites, one or more genes or portions of genes encoding a 
protein or polypeptide of interest, and one or more 5' polynucleotide extensions 
(particularly an extension of guanine residues ranging in length from about 1 to 
about 20, from about 2 to about 15, from about 3 to about 10, from about 4 to 
about 1 0, and most preferably an extension of 4 or 5 guanine residues at the 5' end 
of the recombination site nucleotide sequence. The one or more additional 
functional or structural sequences may or may not flank one or more of the 
recombination site sequences contained on the nucleic acid molecules of the 
invention. 

In some nucleic acid molecules of the invention, the one or more nucleotide 
sequences encoding one or more additional functional or structural sites may be 
operably linked to the nucleotide sequence encoding the recombination site. For 
example, certain nucleic acid molecules of the invention may have a promoter 
sequence operably linked to a nucleotide sequence encoding a recombination site 
or portion thereof of the invention, such as a T7 promoter, a phage lambda PL 
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promoter, an£. coli lac, trp or tac promoter, and other suitable promoters which 
will be familiar to the skilled artisan. 

Nucleic acid molecules of the present invention, which may be isolated nucleic 
acid molecules, may be in the form of RNA, such as mRNA, or in the form of 
DNA, including, for instance, cDNA and genomic DNA obtained by cloning or 
produced synthetically, or in the form of DNA-RNA hybrids. The nucleic acid 
molecules of the invention may be double- stranded or single-stranded. 
Single-stranded DNA or RNA may be the coding strand, also known as the sense 
strand, or it may be the non-coding strand, also referred to as the anti-sense 
strand. The nucleic acid molecules of the invention may also have a number of 
topologies, including linear, circular, coiled, or supercoiled. 

By "isolated" nucleic acid molecule(s) is intended a nucleic acid molecule, 
DNA or RNA, which has been removed from its native environment. For 
example, recombinant DNA molecules contained in a vector are considered 
isolated for the purposes of the present invention. Further examples of isolated 
DNA molecules include recombinant DNA molecules maintained in heterologous 
host cells, and those DNA molecules purified (partially or substantially) from a 
solution whether produced by recombinant DNA or synthetic chemistry 
techniques. Isolated RNA molecules include in vivo or in vitro RNA transcripts 
of the DNA molecules of the present invention. 

The present invention further relates to mutants, fragments, variants and 
derivatives of the nucleic acid molecules of the present invention, which encode 
portions, analogs or derivatives of one or more recombination sites. Variants may 
occur naturally, such as a natural allelic variant. By an "allelic variant" is intended 
one of several alternate forms of a gene occupying a given locus on a chromosome 
of an organism {see Lewin, B., ed., Genes II, , John Wiley & Sons, New York 
(1985)). Non-naturally occurring variants may be produced using art-known 
mutagenesis techniques, such as those described hereinbelow. 

Such variants include those produced by nucleotide substitutions, deletions or 
additions or portions thereof, or combinations thereof The substitutions, 
deletions or additions may involve one or more nucleotides. The variants may be 
altered in coding regions, non-coding regions, or both. Alterations in the coding 
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regions may produce conservative or non-conservative amino acid substitutions, 
deletions or additions. Especially preferred among these are silent substitutions, 
additions and deletions, which do not alter the properties and activities of the 
encoded polypeptide(s) or portions thereof, and which also do not substantially 
alter the reactivities of the recombination site nucleic acid sequences in 
recombination reactions. Also especially preferred in this regard are conservative 
substitutions. 

Particularly preferred mutants, fragments, variants, and derivatives of the 
nucleic acid molecules of the invention include, but are not limited to, insertions, 
deletions or substitutions of one or more nucleotide bases within the 1 5 bp core 
region (GCTTTTTTATACTAA) which is identical in all four wildtype lambda 
att sites, attB, attP, attL and attK {see U.S. Application Nos. 08/663,002, filed 
June 7, 1996 (now U.S. Patent No. 5,888,732), 09/005,476, filed January 12, 
1998, and 09/177,387, filed October 23, 1998, which describes the core region in 
further detail, and the disclosures of which are incorporated herein by reference 
in their entireties). Analogously, the core regions in attB 1 , attP 1 , attLl and atiR\ 
are identical to one another, as are the core regions in a//B2, attPl, attLl and 
attR2. Particularly preferred in this regard are nucleic acid molecules comprising 
insertions, deletions or substitutions of one or more nucleotides within the seven 
bp overlap region (TTTATAC, which is defined by the cut sites for the integrase 
protein and is the region where strand exchange takes place) that occurs within 
this 15 bp core region (GCTTTTTTATACTAA). Examples of such preferred 
mutants, fragments, variants and derivatives according to this aspect of the 
invention include, but are not limited to, nucleic acid molecules in which the 
thymine at position 1 of the seven bp overlap region has been deleted or 
substituted with a guanine, cytosine, or adenine; in which the thymine at position 
2 of the seven bp overlap region has been deleted or substituted with a guanine, 
cytosine, or adenine; in which the thymine at position 3 of the seven bp overlap 
region has been deleted or substituted with a guanine, cytosine, or adenine; in 
which the adenine at position 4 of the seven bp overlap region has been deleted 
or substituted with a guanine, cytosine, or thymine; in which the thymine at 
position 5 of the seven bp overlap region has been deleted or substituted with a 
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guanine, cytosine, or adenine; in which the adenine at position 6 of the seven bp 
overlap region has been deleted or substituted with a guanine, cytosine, or 
thymine; and in which the cytosine at position 7 of the seven bp overlap region has 
been deleted or substituted with a guanine, thymine, or adenine; or any 
combination of one or more such deletions and/or substitutions within this seven 
bp overlap region. As described in detail in Example 21 herein, mutants of the 
nucleic acid molecules of the invention in which substitutions have been made 
within the first three positions of the seven bp overlap (TTTATAC) have been 
found in the present invention to strongly affect the specificity of recombination, 
mutant nucleic acid molecules in which substitutions have been made in the last 
four positions (TTTATAC) only partially alter recombination specificity, and 
mutant nucleic acid molecules comprising nucleotide substitutions outside of the 
seven bp overlap, but elsewhere within the 15 bp core region, do not affect 
specificity of recombination but do influence the efficiency of recombination. 

Hence, in an additional aspect, the present invention is also directed to nucleic 
acid molecules comprising one or more recombination site nucleotide sequences 
that affect recombination specificity, particularly one or more nucleotide 
sequences that may correspond substantially to the seven base pair overlap within 
the 1 5 bp core region, having one or more mutations that affect recombination 
specificity. Particularly preferred such molecules may comprise a consensus 
sequence (described in detail in Example 21 herein) such as NNNATAC, wherein 
"N" refers to any nucleotide {i.e., may be A, G, T/U or C), with the proviso that 
if one of the first three nucleotides in the consensus sequence is a T/U, then at 
least one of the other two of the first three nucelotides is not a T/U. 

In a related aspect, the present invention is also directed to nucleic acid 
molecules comprising one or more recombination site nucleotide sequences that 
enhance recombination efficiency, particularly one or more nucleotide sequences 
that may correspond substantially to the core region and having one or more 
mutations that enhance recombination efficiency. By sequences or mutations that 
"enhance recombination efficiency" is meant a sequence or mutation in a 
recombination site, preferably in the core region {e.g., the 1 5 bp core region of att 
recombination sites), that results in an increase in cloning efficiency (typically 
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measured by determining successful cloning of a test sequence, e.g., by 
determining CFU/ml for a given cloning mixture) when recombining molecules 
comprising the mutated sequence or core region as compared to molecules that 
do not comprise the mutated sequence or core region (e.g., those comprising a 
wildtype recombination site core region sequence). More specifically, whether or 
not a given sequence or mutation enhances recombination efficiency may be 
determined using the sequence or mutation in recombinational cloning as 
described herein, and determining whether the sequence or mutation provides 
enhanced recombinational cloning efficiency when compared to a non-mutated 
(e.g., wildtype) sequence. Methods of determining preferred cloning efficiency- 
enhancing mutations for a number of recombination sites, particularly for att 
recombination sites, are described herein, for example in Examples 22-25. 
Examples of preferred such mutant recombination sites include but are not limited 
to the attL consensus core sequence of caacttnntnnnannaagttg (wherein "n" 
represents any nucleotide), for example the attL5 sequence 
agcctgctttattatactaagttggcatta and the attL6 sequence 
agcctgcttttttatattaagttggcatta; the attB 1 .6 sequence 
ggggacaactttgtacaaaaaagttggct; the attB2.2 sequence 
ggggacaactttgtacaagaaagctgggt; and the attB2A0 sequence 
ggggacaactttgtacaagaaagttgggt. Those of skill in the art will appreciate that, in 
addition to the core region, other portions of the att site may affect the efficiency 
of recombination. There are five so-called arm binding sites for the integrase 
protein in the bacteriophage lambda attP site, two in attR (PI and P2), and three 
in attL (P'l, P'2 and P'3). Compared to the core binding sites, the integrase 
protein binds to arm sites with high affinity and interacts with core and arm sites 
through two different domains of the protein. As with the core binding site a 
consensus sequence for the arm binding site consisting of C/AAGTC ACT AT has 
been inferred from sequence comparison of the five arm binding sites and seven 
non-att sites (Ross andLandy, Proc. Natl. Acad Sci. USA 79:7724-7728 (1982)). 
Each arm site has been mutated and tested for its effect in the excision and 
integration reactions (Numrych eta!.,Nucl. Acids Res. 75:3953 (1990)). Hence, 
specific sites are utilized in each reaction in different ways, namely, the PI and P'3 
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sites are essential for the integration reaction whereas the other three sites are 
dispensable to the integration reaction to varying degrees. Similarly, the P2, P' 1 
and P'2 sites are most important for the excision reaction, whereas PI and P'3 are 
completely dispensable. Interestingly, when P2 is mutated the integration reaction 
occurs more efficiently than with the wild type attP site. Similarly, when PI and 
P'3 are mutated the excision reaction occurs more efficiently. The stimulatory 
effect of mutating integrase arm binding sites can be explained by removing sites 
that compete or inhibit a specific recombination pathway or that function in a 
reaction that converts products back to starting substrates. In fact there is 
evidence for an XlS-independent LR reaction (Abremski and Gottesman, J. Mol 
Biol 753:67-78 (1981)). Thus, in addition to modifications in the core region of 
the att site, the present invention contemplates the use of att sites containing one 
or more modifications in the integrase arm-type binding sites. In some preferred 
embodiments, one or more mutations may be introduced into one or more of the 
PI, P'l, P2, P'2 and P'3 sites. In some preferred embodiments, multiple mutations 
may be introduced into one or more of these sites. Preferred such mutations 
include those which increase the recombination in vitro. For example, in some 
embodiments mutations may be introduced into the arm-type binding sites such 
that integrative recombination, corresponding to the BP reaction, is enhanced. In 
other embodiments, mutations may be introduced into the arm-type binding sites 
such that excisive recombination, corresponding to the LR reaction, is enhanced. 
Of course, based on the guidance contained herein, particularly in the construction 
and evaluation of effects of mutated recombination sites upon recombinational 
specificity and efficiency, analogous mutated or engineered sequences may be 
produced for other recombination sites described herein (including but not limited 
to lox, FRT, and the like) and used in accordance with the invention. For 
example, much like the mutagenesis strategy used to select core binding sites that 
enhance recombination efficiency, similar strategies can be employed to select 
changes in the arms of attP, attL and attR, and in analogous sequences in other 
recombination sites such as lox, FRT and the like, that enhance recombination 
efficiency. Hence, the construction and evaluation of such mutants is well within 
the abilities of those of ordinary skill in the art without undue experimentation. 
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One suitable methodology for preparing and evaluating such mutations is found 
inNumrych, et al., (1990) Nucleic Acids Research 18(13): 3953-3959. 

Other mutant sequences and nucleic acid molecules that may be suitable to 
enhance recombination efficiency will be apparent from the description herein, or 
may be easily determined by one of ordinary skill using only routine 
experimentation in molecular biology in view of the description herein and 
information that is readily available in the art 

Since the genetic code is well known in the art, it is also routine for one of 
ordinary skill in the art to produce degenerate variants of the nucleic acid 
molecules described herein without undue experimentation. Hence, nucleic acid 
molecules comprising degenerate variants of nucleic acid sequences encoding the 
recombination sites described herein are also encompassed within the scope of the 
invention. 

Further embodiments of the invention include isolated nucleic acid molecules 
comprising a polynucleotide having a nucleotide sequence at least 50% identical, 
at least 60% identical, at least 70% identical, at least 75% identical, at least 80% 
identical, at least 85% identical, at least 90% identical, and more preferably at least 
95%, 96%, 97%, 98% or 99% identical to the nucleotide sequences of the 
seven bp overlap region within the 1 5 bp core region of the recombination sites 
described herein, or the nucleotide sequences ofatfBl, attBl, attP\,att?2, atiLl, 
atfL2, attKl or attR2 as set forth in Figure 9 (or portions thereof), or a nucleotide 
sequence complementary to any of these nucleotide sequences, or fragments, 
variants, mutants, and derivatives thereof 

By a polynucleotide having a nucleotide sequence at least, for example, 95% 
"identical" to a reference nucleotide sequence encoding a particular recombination 
site or portion thereof is intended that the nucleotide sequence of the 
polynucleotide is identical to the reference sequence except that the polynucleotide 
sequence may include up to five point mutations (e.g. , insertions, substitutions, or 
deletions) per each 100 nucleotides of the reference nucleotide sequence encoding 
the recombination site. For example, to obtain a polynucleotide having a 
nucleotide sequence at least 95% identical to a reference attBl nucleotide 
sequence, up to 5% of the nucleotides in the attBl reference sequence may be 



WO 00/52027 PCT/US00/05432 

-63- 

deleted or substituted with another nucleotide, or a number of nucleotides up to 
5% of the total nucleotides in the attBl reference sequence may be inserted into 
the attBl reference sequence. These mutations of the reference sequence may 
occur at the 5' or 3' terminal positions of the reference nucleotide sequence or 
anywhere between those terminal positions, interspersed either individually among 
nucleotides in the reference sequence or in one or more contiguous groups within 
the reference sequence. 

As a practical matter, whether any particular nucleic acid molecule is at least 
50%, 60%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98% or 99% identical 
to, for instance, a given recombination site nucleotide sequence or portion thereof 
can be determined conventionally using known computer programs such as 
DNAsis software (Hitachi Software, San Bruno, California) for initial sequence 
alignment followed by ESEE version 3.0 DN A/protein sequence software 
(cabot@trog.mbb.sfii.ca) for multiple sequence alignments. Alternatively, such 
determinations may be accomplished using the BESTFIT program (Wisconsin 
Sequence Analysis Package, Genetics Computer Group, University Research Park, 
575 Science Drive, Madison, WI 53711), which employs a local homology 
algorithm (Smith and Waterman, Advances in Applied Mathematics 2: 482-489 
(1981)) to find the best segment of homology between two sequences. When 
using DNAsis, ESEE, BESTFIT or any other sequence alignment program to 
determine whether a particular sequence is, for instance, 95% identical to a 
reference sequence according to the present invention, the parameters are set such 
that the percentage of identity is calculated over the full length of the reference 
nucleotide sequence and that gaps in homology of up to 5% of the total number 
of nucleotides in the reference sequence are allowed. 

The present invention is directed to nucleic acid molecules at least 50%, 60%, 
70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98% or 99% identical to the attBl, 
attB2, attP 1 , attP2 7 attL 1 , atiLl, attKl or attR2 nucleotide sequences as set forth 
in Figure 9, or to the nucleotide sequence of the deposited clones, irrespective of 
whether they encode particular functional polypeptides. This is because even 
where a particular nucleic acid molecule does not encode a particular functional 
polypeptide, one of skill in the art would still know how to use the nucleic acid 
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molecule, for instance, as a hybridization probe or a polymerase chain reaction 
(PCR) primer. 

Mutations can also be introduced into the recombination site nucleotide 
sequences for enhancing site specific recombination or altering the specificities of 
the reactants, etc. Such mutations include, but are not limited to: recombination 
sites without translation stop codons that allow fusion proteins to be encoded; 
recombination sites recognized by the same proteins but differing in base sequence 
such that they react largely or exclusively with their homologous partners allowing 
multiple reactions to be contemplated, and mutations that prevent hairpin 
formation of recombination sites. Which particular reactions take place can be 
specified by which particular partners are present in the reaction mixture. 

There are well known procedures for introducing specific mutations into 
nucleic acid sequences. A number of these are described in Ausubel, F.M. et aL, 
Current Protocols in Molecular Biology, Wiley Interscience, New York (1989- 
1996). Mutations can be designed into oligonucleotides, which can be used to 
modify existing cloned sequences, or in amplification reactions. Random 
mutagenesis can also be employed if appropriate selection methods are available 
to isolate the desired mutant DNA or RNA. The presence of the desired 
mutations can be confirmed by sequencing the nucleic acid by well known 
methods. 

The following non-limiting methods can be used to modify or mutate a given 
nucleic acid molecule encoding a particular recombination site to provide mutated 
sites that can be used in the present invention: 

1 . By recombination of two parental DNA sequences by site- specific (e.g. attL 

and attR to give attP) or other (e.g. homologous) recombination 
mechanisms where the parental DNA segments contain one or more base 
alterations resulting in the final mutated nucleic acid molecule; 

2. By mutation or mutagenesis (site-specific, PCR, random, spontaneous, etc) 

directly of the desired nucleic acid molecule; 

3. By mutagenesis (site-specific, PCR, random, spontaneous, etc) of parental 

DNA sequences, which are recombined to generate a desired nucleic acid 
molecule; 
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4. By reverse transcription of an RNA encoding the desired core sequence; 

and 

5. By de novo synthesis (chemical synthesis) of a sequence having the desired 

base changes, or random base changes followed by sequencing or 
functional analysis according to methods that are routine in the art. 

The functionality of the mutant recombination sites can be demonstrated in 
ways that depend on the particular characteristic that is desired. For example, the 
lack of translation stop codons in a recombination site can be demonstrated by 
expressing the appropriate fusion proteins. Specificity of recombination between 
homologous partners can be demonstrated by introducing the appropriate 
molecules into in vitro reactions, and assaying for recombination products as 
described herein or known in the art. Other desired mutations in recombination 
sites might include the presence or absence of restriction sites, translation or 
transcription start signals, protein binding sites, particular coding sequences, and 
other known functionalities of nucleic acid base sequences. Genetic selection 
schemes for particular functional attributes in the recombination sites can be used 
according to known method steps. For example, the modification of sites to 
provide (from a pair of sites that do not interact) partners that do interact could 
be achieved by requiring deletion, via recombination between the sites, of a DNA 
sequence encoding a toxic substance. Similarly, selection for sites that remove 
translation stop sequences, the presence or absence of protein binding sites, etc., 
can be easily devised by those skilled in the art. 

Accordingly, the present invention also provides a nucleic acid molecule, 
comprising at least one DNA segment having at least one, and preferably at least 
two, engineered recombination site nucleotide sequences of the invention flanking 
a selectable marker and/or a desired DNA segment, wherein at least one of said 
recombination site nucleotide sequences has at least one engineered mutation that 
enhances recombination in vitro in the formation of a Cointegrate DNA or a 
Product DNA. Such engineered mutations may be in the core sequence of the 
recombination site nucleotide sequence of the invention; see U.S. Application Nos. 
08/486,139, filed June 7, 1995, 08/663,002, filed June 7, 1996 (now U.S. Patent 
No. 5,888,732), 09/005,476, filed January 12, 1998, and 09/177,387, filed 
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October 23, 1 998, the disclosures of which are all incorporated herein by reference 
in their entireties. 

While in the preferred embodiment the recombination sites differ in sequence 
and do not interact with each other, it is recognized that sites comprising the same 
sequence, which may interact with each other, can be manipulated or engineered 
to inhibit recombination with each other. Such conceptions are considered and 
incorporated herein. For example, a protein binding site (e.g. , an antibody-binding 
site, a histone-binding site, an enzyme-binding site, or a binding site for any nucleic 
acid molecule-binding protein) can be engineered adjacent to one of the sites. In 
the presence of the protein that recognizes the engineered site, the recombinase 
fails to access the site and another recombination site in the nucleic acid molecule 
is therefore used preferentially. In the cointegrate this site can no longer react 
since it has been changed, e.g., from attB to attL. During or upon resolution of 
the cointegrate, the protein can be inactivated (e.g., by antibody, heat or a change 
of buffer) and the second site can undergo recombination. 

The nucleic acid molecules of the invention can have at least one mutation 
that confers at least one enhancement of said recombination, said enhancement 
selected from the group consisting of substantially (i) favoring integration; (ii) 
favoring recombination; (ii) relieving the requirement for host factors; (iii) 
increasing the efficiency of said Cointegrate DNA or Product DNA formation; 
(iv) increasing the specificity of said Cointegrate DNA or Product DNA 
formation; and (v) adding or deleting protein binding sites. 

In other embodiments, the nucleic acid molecules of the invention may be PCR 
primer molecules, which comprise one or more of the recombination site 
sequences described herein or portions thereof, particularly those shown in 
Figure 9 (or sequences complementary to those shown in Figure 9), or mutants, 
fragments, variants or derivatives thereof, attached at the 3' end to a target- 
specific template sequence which specifically interacts with a target nucleic acid 
molecule which is to be amplified. Primer molecules according to this aspect of 
the invention may further comprise one or more, (e.g., 1, 2, 3, 4, 5, 10, 20, 25, 50, 
100, 500, 1000, or more) additional bases at their 5' ends, and preferably comprise 
one or more (particularly four or five) additional bases, which are preferably 
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guanines, at their 5' ends, to increase the efficiency of the amplification products 
incorporating the primer molecules in the recombinational cloning system of the 
invention. Such nucleic acid molecules and primers are described in detail in the 
examples herein, particularly in Examples 22-25. 

Certain primers of the invention may comprise one or more nucleotide 
deletions in the attB 1 , attBl, atiP 1 , attVl, attL 1 , attLl, atiRl or atfR2 sequences 
as set forth in Figure 9. In one such aspect, for example, attBl primers may be 
constructed in which one or more of the first four nucleotides at the 5' end of the 
attBl sequence shown in Figure 9 have been deleted. Primers according to this 
aspect of the invention may therefore have the sequence: 
(attB2(-\)): CCCAGCTTTCTTGTACAAAGTGGTnnnnnnnnnnnnn . . . n 
(<tf/B2(-2)): CCAGCTTTCTTGTACAAAGTGGTnnnnnnnnnnnnnn . . . n 
(af/B2(-3)): CAGCTTTCTTGTACAAAGTGGTnnnnnnnnnnnnnnn . . . n 
(af/B2(-4)): AGCTTTCTTGTACAAAGTGGTnnnnnnnnnnnnnnnn . . . n, 
wherein "nnnnnnnnnnnnn . . . n" at the 3' end of the primer represents a target- 
specific sequence of any length, for example from one base up to all of the bases 
of a target nucleic acid molecule (e.g., a gene) or a portion thereof, the sequence 
and length which will depend upon the identity of the target nucleic acid molecule 
which is to be amplified. 

The primer nucleic acid molecules according to this aspect of the invention 
may be produced synthetically by attaching the recombination site sequences 
depicted in Figure 9, or portions thereof, to the 5' end of a standard PCR target- 
specific primer according to methods that are well-known in the art. Alternatively, 
additional primer nucleic acid molecules of the invention may be produced 
synthetically by adding one or more nucleotide bases, which preferably correspond 
to one or more, preferably five or more, and more preferably six or more, 
contiguous nucleotides of the att nucleotide sequences described herein (see, e.g. , 
Example 20 herein; seeatsoXJ.S. Application Nos. 08/663,002, filed June 7, 1996 
(now U.S. Patent No. 5,888,732), 09/005,476, filed January 12, 1998, and 
09/177,387, filed October 23, 1998, the disclosures of which are all incorporated 
herein by reference in their entireties), to the 5' end of a standard PCR target- 
specific primer according to methods that are well-known in the art, to provide 
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primers having the specific nucleotide sequences described herein. As noted 
above, primer nucleic acid molecules according to this aspect of the invention may 
also optionally comprise one, two, three, four, five, or more additional nucleotide 
bases at their 5' ends, and preferably will comprise four or five guanines at their 
5' ends. In one particularly preferred such aspect, the primer nucleic acid 
molecules of the invention may comprise one or more, preferably five or more, 
more preferably six or more, still more preferably 6-18 or 6-25, and most 
preferably 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24 or 
25, contiguous nucleotides or bp of the attBl or attB2 nucleotide sequences 
depicted in Figure 9 (or nucleotides complementary thereto), linked to the 5' end 
of a target- specific (e.g., a gene-specific) primer molecule. Primer nucleic acid 
molecules according to this aspect of the invention include, but are not limited to, 
attBl- and a//B2-derived primer nucleic acid molecules having the following 
nucleotide sequences: 

ACAAGTTTGTACAAAAAAGCAGGCT-nnnnnnnnnnnnn . . . n 
ACCACTTTGTACAAGAAAGCTGGGT-nnnnnnnnnnnnn . . . n 
TGTACAAAAAAGCAGGCT-nnnnnnnnnnnnn . . . n 
TGTACAAGAAAGCTGGGT-nnnnnnnnnnnnn . . . n 
ACAAAAAAGCAGGCT-nnnnnnnnnnnnn . . n 
ACAAGAAAGCTGGGT-nnnnnnnnnnnnn . . . n 
AAAAAGCAGGCT-nnnnnnnnnnnnn . . . n 
AGAAAGCTGGGT-nnnnnnnnnnnnn . . . n 
AAAAGCAGGCT-nnnnnnnnnnnnn . . . n 
GAAAGCTGGGT-nnnnnnnnnnnnn . . . n 
AAAGCAGGCT-nnnnnnnnnnnnn . . . n 
AAAGCTGGGT-nnnnnnnnnnnnn . . . n 
AAGCAGGCT-nnnnnnnnnnnnn , . . n 
AAGCTGGGT-nnnnnnnnnnnnn . . . n 
AGCAGGCT-nnnnnnnnnnnnn . . . n 
AGCTGGGT-nnnnnnnnnnnnn . . . n 
GCAGGCT-nnnnnnnnnnnnn . . . n 
GCTGGGT-nnnnnnnnnnnnn . . . n 
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CAGGCT-nnnnnnnnnnnnn . . . n 
CTGGGT-nnnnnnnnnnnnn . . . n, 

wherein "nnnnnnnnnnnnn . . . n H at the 3' end of the primer represents a target- 
specific sequence of any length, for example from one base up to all of the bases 
of a target nucleic acid molecule (e.g., a gene) or a portion thereof, the sequence 
and length which will depend upon the identity of the target nucleic acid molecule 
which is to be amplified. 

Of course, it will be apparent to one of ordinary skill from the teachings 
contained herein that additional primer nucleic acid molecules analogous to those 
specifically described herein may be produced using one or more, preferably five 
or more, more preferably six or more, still more preferably ten or more, 1 5 or 
more, 20 or more, 25 or more, 30 or more, etc. (through to and including all) of 
the contiguous nucleotides or bp of the atfPl, atfP2, a#Ll , a//L2, atiRA or attR2 
nucleotide sequences depicted in Figure 9 (or nucleotides complementary thereto), 
linked to the 5' end of a target- specific (e.g. , a gene- specific) primer molecule. As 
noted above, such primer nucleic acid molecules may optionally further comprise 
one, two, three, four, five, or more additional nucleotide bases at their 5' ends, and 
preferably will comprise four guanines at their 5' ends. Other primer molecules 
comprising the attBl, attBl, attPl, attP2 9 atfLX, attiJl, atiRl and attRl 
sequences depicted in Figure 9, or portions thereof, may be made by one of 
ordinary skill without resorting to undue experimentation in accordance with the 
guidance provided herein. 

The primers of the invention described herein are useful in producing PCR 
fragments having a nucleic acid molecule of interest flanked at each end by a 
recombination site sequence (as described in detail below in Example 9), for use 
in cloning of PCR-amplified DNA fragments using the recombination system of 
the invention (as described in detail below in Examples 8, 19 and 21-25). 

Vectors 

The invention also relates to vectors comprising one or more of the nucleic 
acid molecules of the invention, as described herein. In accordance with the 
invention, any vector may be used to construct the vectors of the invention. In 
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particular, vectors known in the art and those commercially available (and variants 
or derivatives thereof) may in accordance with the invention be engineered to 
include one or more nucleic acid molecules encoding one or more recombination 
sites (or portions thereof), or mutants, fragments, or derivatives thereof, for use 
in the methods of the invention. Such vectors may be obtained from, for example, 
Vector Laboratories Inc., InVitrogen, Promega, Novagen, New England Biolabs, 
Clontech, Roche, Pharmacia, EpiCenter, OriGenes Technologies Inc., Stratagene, 
Perkin Elmer, Pharmingen, Life Technologies, Inc., and Research Genetics. Such 
vectors may then for example be used for cloning or subcloning nucleic acid 
molecules of interest. General classes of vectors of particular interest include 
prokaryotic and/or eukaryotic cloning vectors, Expression Vectors, fusion vectors, 
two-hybrid or reverse two-hybrid vectors, shuttle vectors for use in different 
hosts, mutagenesis vectors, transcription vectors, vectors for receiving large 
inserts and the like. 

Other vectors of interest include viral origin vectors (Ml 3 vectors, bacterial 
phage X vectors, bacteriophage PI vectors, adenovirus vectors, herpesvirus 
vectors, retrovirus vectors, phage display vectors, combinatorial library vectors), 
high, low, and adjustable copy number vectors, vectors which have compatible 
replicons for use in combination in a single host (pACYC184 and pBR322) and 
eukaryotic episomal replication vectors (pCDM8). 

Particular vectors of interest include prokaryotic Expression Vectors such as 
pcDNAII, pSL301, pSE280, pSE380, pSE420, pTrcHisA, B, and C, pRSET A, 
B, and C (Invitrogen, Inc.), pGEMEX-1, and pGEMEX-2 (Promega, Inc.), the 
pET vectors (Novagen, Inc.), pTrc99A, pKK223-3, the pGEX vectors, pEZZ18, 
pRIT2T, and pMC1871 (Pharmacia, Inc.), pKK233-2 and pKK388-l (Clontech, 
Inc.), and pProEx-HT (Life Technologies, Inc.) and variants and derivatives 
thereof. Destination Vectors can also be made from eukaryotic Expression 
Vectors such as pFastBac, pFastBac HT, pFastBac DUAL, pSFV, and pTet- 
Splice (Life Technologies, Inc.), pEUK-Cl, pPUR, pMAM, pMAMneo, pBHOl, 
pBI121, pDR2, pCMVEBNA, and pYACneo (Clontech), pSVK3, pSVL, pMSG, 
pCHl 10, and pKK232-8 (Pharmacia, Inc.), p3'SS, pXTl, pSG5, pPbac, pMbac, 
pMCl neo, and pOG44 (Stratagene, Inc.), and pYES2, pAC360, pBlueBacHis A, 
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B, and C, pVL1392, pBsueBacIII, pCDM8, pcDNAl, pZeoSV, pcDNA3 pREP4, 
pCEP4, and pEBVHis (Invitrogen, Inc.) and variants or derivatives thereof 

Other vectors of particular interest include pUC18, pUC19, pBlueScript, 
pSPORT, cosmids, phagemids, YACs (yeast artificial chromosomes), BACs 
(bacterial artificial chromosomes), MACs (mammalian artificial chromosomes), 
pQE70, pQE60, pQE9 (Quiagen), pBS vectors, PhageScript vectors, BlueScript 
vectors, pNH8A, pNH16A, pNH18A, pNH46A (Stratagene), pcDNA3 
(InVitrogen), pGEX, pTrsfus, pTrc99A, pET-5, pET-9, pKK223-3, pKK233-3, 
pDR540, pRIT5 (Pharmacia), pSPORT 1 , pSPORT2, pCMVSPORT2.0 and pS V- 
SPORT1 (Life Technologies, Inc.) and variants or derivatives thereof. 

Additional vectors of interest include pTrxFus, pThioHis, pLEX, pTrcHis, 
pTrcHis2, pRSET, pBlueBacHis2, pcDNA3. 1/His, pcDNA3 . 1 (-)ZMyc-His, 
pSecTag, pEBVHis, pPIC9K, pPIC3.5K, pA0815, pPICZ, pPICZa, pGAPZ, 
pGAPZa, pBlueBac4.5, pBlueBacHis2, pMelBac, pSinRep5, pSinHis, pIND, 
piND(SPl), pVgRXR, pcDNA2.1. pYES2, pZErOl.l, pZErO-2.1, pCR-Blunt, 
pSE280, pSE380, pSE420, pVL1392, pVL1393, pCDM8, pcDNAl.l, 
pcDNAl . 1/Amp, pcDNA3. 1, pcDNA3. 1/Zeo, pSe,SV2, pRc/CMV2, pRc/RSV, 
pREP4, pREP7, pREP8, pREP9, pREPIO, pCEP4, pEBVHis, pCR3. 1, pCR2. 1, 
pCR3 . 1-Uni, and pCRBac from Invitrogen; AExCell, Agtl 1, pTrc99A, pKK223-3, 
pGEX-UT, pGEX-2T, pGEX-2TK, pGEX-4T-l, pGEX-4T-2, pGEX-4T-3, 
pGEX-3X, pGEX-5X-l, pGEX-5X-2, pGEX-5X-3, pEZZ18, pRIT2T, 
pMC1871, pSVK3, pSVL, pMSG, pCHHO, pKK232-8, pSL1180, pNEO, and 
pUC4K from Pharmacia; pSCREEN-lb(+), pT7Blue(R), pT7Blue-2, pCITE- 
4abc(+), pOCUS-2, pTAg, pET-32LIC, pET-30LIC, pBAC-2cpLIC, pBACgus- 
2cp LIC, pT7Blue-2 LIC, pT7Blue-2, ASCREEN-1, ABlueSTAR, pET-3abcd, 
pET-7abc, pET9abcd, pETllabcd, pET12abc, pET-14b, pET-15b, pET-16b, 
pET- 1 7b- pET- 1 7xb, pET- 1 9b, pET-20b(+), pET-2 1 abcd(+), pET-22b(+), pET- 
23abcd(+), pET-24abcd(+), pET-25b(+), pET-26b(+), pET-27b(+), pET- 
28abc(+), pET-29abc(+), pET-30abc(+), pET-31b(+), pET-32abc(+), pET- 
33b(+), pBAC-1, pBACgus-1, pBAC4x-l, P BACgus4x-l, pBAC-3cp, pBACgus- 
2cp, pBACsurf-1, pig, Signal pig, pYX, Selecta Vecta-Neo, Selecta Vecta - Hyg, 
and Selecta Vecta - Gpt from Novagen; pLexA, pB42AD, pGBT9, pAS2-l, 



WO 00/52027 PCT/US00/05432 

-72- 

pGAD424, pACT2, pGAD GL, pGAD GH, pGADIO, pGilda, pEZM3, pEGFP, 
pEGFP- 1 , pEGFP-N, pEGFP-C, pEBFP, pGFPuv, pGFP, P 6xHis-GFP, pSEAP2- 
Basic, pSEAP2-Contral, pSEAP2-Promoter, pSEAP2-Enhancer, ppgal-Basic, 
pPgal-Control, ppgal-Promoter, pPgal-Enhancer, pCMVp, pTet-Off, pTet-On, 
pTK-Hyg, pRetro-OfF, pRetro-On, pIRESlneo, pIRESlhyg, pLXSN, pLNCX, 
pLAPSN, pMAMneo, pMAMneo-CAT, pMAMneo-LUC, pPUR, pSV2neo, 
pYEX 4T- 1/2/3, pYEX-S 1 , pBacPAK-His, P BacPAK8/9, P AcUW3 1 , BacPAK6, 
pTriplEx, AgtlO, Xgtl 1, pWE15, and ATriplEx from Clontech; Lambda ZAP II, 
pBK-CMV, pBK-RS V, pBluescript II KS +/-, pBluescript II SK +/-, pAD-GAL4, 
pBD-GAL4 Cam, pSurfscript, Lambda FIX II, Lambda DASH, Lambda EMBL3, 
Lambda EMBL4, SuperCos, pCR-Scrigt Amp, pCR-Script Cam, pCR-Script 
Direct, pBS +/-, pBC KS +/-, pBC SK +/-, Phagescript, pCAL-n-EK, pCAL-n, 
pCAL-c, pCAL-kc, pET-3abcd, pET-llabcd, pSPUTK, pESP-1, pCMVLad, 
pOPRSVI/MCS, pOPI3 CAT, pXTl, pSG5, pPbac, pMbac, pMClneo, pMClneo 
Poly A, pOG44, pOG45, pFRTPGAL, pNEOpGAL, pRS403, pRS404, pRS405, 
pRS406, pRS413, pRS414, pRS415, and pRS416 from Stratagene. 

Two-hybrid and reverse two-hybrid vectors of particular interest include 
pPC86, pDBLeu, pDBTrp, pPC97, p2.5, pGADl-3, pGADIO, pACt, pACT2, 
pGADGL, pGADGH, pAS2-l, pGAD424, pGBT8, pGBT9, pGAD-GAL4, 
pLexA, pBD-GAL4, pHISi, pHISi-1, placZi, pB42AD, pDG202, pJK202, 
pJG4-5, pNLexA, pYESTrp and variants or derivatives thereof. 

Yeast Expression Vectors of particular interest include pESP-1, pESP-2, 
pESC-His, pESC-Trp, pESC-URA, pESC-Leu (Stratagene), pRS401, pRS402, 
pRS41 1, pRS412, pRS421, pRS422, and variants or derivatives thereof. 

According to the invention, the vectors comprising one or more nucleic acid 
molecules encoding one or more recombination sites, or mutants, variants, 
fragments, or derivatives thereof, may be produced by one of ordinary skill in the 
art without resorting to undue experimentation using standard molecular biology 
methods. For example, the vectors of the invention may be produced by 
introducing one or more of the nucleic acid molecules encoding one or more 
recombination sites (or mutants, fragments, variants or derivatives thereof) into 
one or more of the vectors described herein, according to the methods described, 
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for example, in Maniatis et aL, Molecular Cloning: A Laboratory Manual, Cold 
Spring Harbor Laboratory, Cold Spring Harbor, New York (1982). In a related 
aspect of the invention, the vectors may be engineered to contain, in addition to 
one or more nucleic acid molecules encoding one or more recombination sites (or 
portions thereof), one or more additional physical or functional nucleotide 
sequences, such as those encoding one or more multiple cloning sites, one or more 
transcription termination sites, one or more transcriptional regulatory sequences 
(e.g., one or more promoters, enhancers, or repressors), one or more selection 
markers or modules, one or more genes or portions of genes encoding a protein 
or polypeptide of interest, one or more translational signal sequences, one or more 
nucleotide sequences encoding a fusion partner protein or peptide (e.g. , GST, His 6 
or thioredoxin), one or more origins of replication, and one or more 5' or 3' 
polynucleotide tails (particularly a poly-G tail). According to this aspect of the 
invention, the one or more recombination site nucleotide sequences (or portions 
thereof) may optionally be operably linked to the one or more additional physical 
or functional nucleotide sequences described herein. 

Preferred vectors according to this aspect of the invention include, but are not 
limited to: pENTRl A (Figures lOAand 10B), pENTR2B (Figures 1 1 A and 1 IB), 
pENTR3C (Figures 12A and 12B), pENTR4 (Figures 13A and 13B), pENTR5 
(Figures 14Aand 14B), pENTR6 (Figures ISAand 15B), pENTR7 (Figures 16A 
and 16B), pENTR8 (Figures 17A and 17B), pENTR9 (Figures 18A and 18B), 
pENTRlO (Figures 19A and 19B), pENTRll (Figures 20A and 20B), pDESTl 
(Figures 21 A-D), pDEST2 (Figure 22A-D), pDEST3 (Figure 23A-D), pDEST4 
(Figure 24A-D), pDEST5 (Figure 25A-D), pDEST6 (Figure 26A-D), pDEST7 
(Figure 27A-C), pDEST8 (Figure 28 A-D), pDEST9 (Figure 29A-E), pDESTIO 
(Figure 30A-D), pDESTll (Figure 31 A-D), pDEST12.2 (also known as 
pDEST12) (Figure 32A-D), pDEST13 (Figure 33A-C), pDEST14 (Figure 34A- 
D), pDEST15 (Figure 35A-D), pDEST16 (Figure 36A-D), pDEST17 (Figure 
3 7 A-D), pDEST18 (Figure 38A-D), pDEST19 (Figure 39A-D), pDEST20 
(Figure 40 A-D), pDEST2 1 (Figure 4 1 A-E), pDEST22 (Figure 42 A-D), pDEST23 
(Figure 43A-D), pDEST24 (Figure 44A-D), pDEST25 (Figure 45A-D), 
pDEST26 (Figure 46A-D), pDEST27 (Figure 47A-D), pEXP501 (also known 
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as pCMVSPORT6) (Figure 48A-B), pDONR201 (also known as pENTR2 1 attP 
vector or pAttPkan Donor Vector) (Figure 49), pDONR202 (Figure 50), 
pDONR203 (also known as pEZ15812) (Figure 51), pDONR204 (Figure 52), 
pDONR205 (Figure 53), pDONR206 (also known as pENTR22 attP vector or 
pAttPgen Donor Vector) (Figure 54), pMAB58 (Figure 87), pMAB62 
(Figure 88), pDEST28 (Figure 90), pDEST29 (Figure 91), pDEST30 (Figure 92), 
pDEST31 (Figure 93), pDEST32 (Figure 94), pDEST33 (Figure 95), pDEST34 
(Figure 96), pDONR207 (Figure 97), pMAB85 (Figure 98), pMAB86 
(Figure 99), and fragments, mutants, variants, and derivatives thereof. However, 
it will be understood by one of ordinary skill that the present invention also 
encompasses other vectors not specifically designated herein, which comprise one 
or more of the isolated nucleic acid molecules of the invention encoding one or 
more recombination sites or portions thereof (or mutants, fragments, variants or 
derivatives thereof), and which may further comprise one or more additional 
physical or functional nucleotide sequences described herein which may optionally 
be operably linked to the one or more nucleic acid molecules encoding one or 
more recombination sites or portions thereof Such additional vectors may be 
produced by one of ordinary skill according to the guidance provided in the 
present specification. 

Polymerases 

Preferred polypeptides having reverse transcriptase activity (i.e., those 
polypeptides able to catalyze the synthesis of a DNA molecule from an RNA 
template) for use in accordance with the present invention include, but are not 
limited to Moloney Murine Leukemia Virus (M-MLV) reverse transcriptase, Rous 
Sarcoma Virus (RSV) reverse transcriptase, Avian Myeloblastosis Virus (AMV) 
reverse transcriptase, Rous Associated Virus (RAV) reverse transcriptase, 
Myeloblastosis Associated Virus (MAV) reverse transcriptase, Human 
Immunodeficiency Virus (HIV) reverse transcriptase, retroviral reverse 
transcriptase, retrotransposon reverse transcriptase, hepatitis B reverse 
transcriptase, cauliflower mosaic virus reverse transcriptase and bacterial reverse 
transcriptase. Particularly preferred are those polypeptides having reverse 
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transcriptase activity that are also substantially reduced in RNAse H activity {i.e., 
"RNAse H" polypeptides). By a polypeptide that is "substantially reduced in 
RNase H activity" is meant that the polypeptide has less than about 20%, more 
preferably less than about 15%, 10% or 5%, and most preferably less than about 
2%, of the RNase H activity of a wildtype or RNase IT enzyme such as wildtype 
M-MLV reverse transcriptase. The RNase H activity may be determined by a 
variety of assays, such as those described, for example, in U.S. Patent No. 
5,244,797, in Kotewicz, MI. et aL, Nucl Acids Res. 76:265 (1988) and in 
Gerard, G.F., etaL, FOCUS 14(5):91 (1992), the disclosures of all of which are 
fully incorporated herein by reference. Suitable RNAse H" polypeptides for use 
in the present invention include, but are not limited to, M-MLV H" reverse 
transcriptase, RS V H" reverse transcriptase, AMV H" reverse transcriptase, RAV 
H" reverse transcriptase, MAV H" reverse transcriptase, HIV H" reverse 
transcriptase, ThermoScript™ reverse transcriptase and ThermoScript™ II 
reverse transcriptase, and SUPERSCRIPT™ I reverse transcriptase and 
Superscript™ II reverse transcriptase, which are obtainable, for example, from 
Life Technologies, Inc. (Rockville, Maryland). See generally published PCT 
application WO 98/47912. 

Other polypeptides having nucleic acid polymerase activity suitable for use in 
the present methods include thermophilic DNA polymerases such as DNA 
polymerase I, DNA polymerase III, Klenow fragment, T7 polymerase, and T5 
polymerase, and thermostable DNA polymerases including, but not limited to, 
Thermus thermophilus (Tth) DNA polymerase, Thermus aquaticus (Tag) DNA 
polymerase, Thermotoga neopolitana (Tne) DNA polymerase, Thermotoga 
maritima (Tma) DNA polymerase, Thermococcus litoralis (77/ or VENT®) 
DNA polymerase, Pyrococcus furiosus (Pfu) DNA polymerase, Pyrococcus 
species GB-D (or DEEPVENT®) DNA polymerase, Pyrococcus woosii (Pwo) 
DNA polymerase, Bacillus sterothermophilus (Bst) DNA polymerase, Sulfolobus 
acidocaldarius (Sac) DNA polymerase, Thermoplasma acidophilum (Tac) DNA 
polymerase, Thermus flavus (Tfl/Tub) DNA polymerase, Thermus ruber (Tru) 
DNA polymerase, Thermus brockianus (DYNAZYME®) DNA polymerase, 
Methanobacterium thermoautotrophicum (Mth) DNA polymerase, and mutants, 
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variants and derivatives thereof. Such polypeptides are available commercially, 
for example from Life Technologies, Inc. (Rockville, MD), New Englan BioLabs 
(Beverly, MA), and Sigma/ Aldrich (St. Louis, MO). 

Host Cells 

The invention also relates to host cells comprising one or more of the nucleic 
acid molecules or vectors of the invention, particularly those nucleic acid 
molecules and vectors described in detail herein. Representative host cells that 
may be used according to this aspect of the invention include, but are not limited 
to, bacterial cells, yeast cells, plant cells and animal cells. Preferred bacterial host 
cells include Escherichia spp. cells (particularly E. coli cells and most particularly 
E. coh strains DH10B, Stbl2, DH5a, DB3, DB3 .1 (preferably E. coli LIBRARY 
EFFICIENCY® DB3.1™ Competent Cells, Life Technologies, Inc., Rockville, 
MD), DB4 and DBS; see U.S. Provisional Application No. 60/122,392, filed on 
March 2, 1999, the disclosure of which is incorporated by reference herein in its 
entirety), Bacillus spp. cells (particularly B. subtilis and B. megaterium cells), 
Streptomyces spp. cells, Erwinia spp. cells, Klebsiella spp. cells, Serratia spp. 
cells (particularly S. mar cessans cells), Pseudomonas spp. cells (particularly 
P. aeruginosa cells), and Salmonella spp. cells (particularly S. typhimurium and 
S. typhi cells). Preferred animal host cells include insect cells (most particularly 
Drosophila melanogaster cells, Spodoptera frugiperda Sf9 and S£21 cells and 
Trichoplusa High-Five cells), nematode cells (particularly C elegans cells), avian 
cells, amphibian cells (particularly Xenopus laevis cells), reptilian cells, and 
mammalian cells (most particularly CHO, COS, VERO, BHK and human cells). 
Preferred yeast host cells include Saccharomyces cerevisiae cells and Pichia 
pastoris cells. These and other suitable host cells are available commercially, for 
example from Life Technologies, Inc. (Rockville, Maryland), American Type 
Culture Collection (Manassas, Virginia), and Agricultural Research Culture 
Collection (NRRL; Peoria, Illinois). 

Methods for introducing the nucleic acid molecules and/or vectors of the 
invention into the host cells described herein, to produce host cells comprising one 
or more of the nucleic acid molecules and/or vectors of the invention, will be 
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familiar to those of ordinary skill in the art. For instance, the nucleic acid 
molecules and/or vectors of the invention may be introduced into host cells using 
well known techniques of infection, transduction, transfection, and transformation. 
The nucleic acid molecules and/or vectors of the invention may be introduced 
alone or in conjunction with other the nucleic acid molecules and/or vectors. 
Alternatively, the nucleic acid molecules and/or vectors of the invention may be 
introduced into host cells as a precipitate, such as a calcium phosphate precipitate, 
or in a complex with a lipid. Electroporation also may be used to introduce the 
nucleic acid molecules and/or vectors of the invention into a host. Likewise, such 
molecules may be introduced into chemically competent cells such as E. coli. If 
the vector is a virus, it may be packaged in vitro or introduced into a packaging 
cell and the packaged virus may be transduced into cells. Hence, a wide variety 
of techniques suitable for introducing the nucleic acid molecules and/or vectors of 
the invention into cells in accordance with this aspect of the invention are well 
known and routine to those of skill in the art. Such techniques are reviewed at 
length, for example, in Sambrook, J., et al, Molecular Cloning, a Laboratory 
Manual \ 2nd Ed., Cold Spring Harbor, NY: Cold Spring Harbor Laboratory 
Press, pp. 16.30-16.55 (1989), Watson, J.D., et al , Recombinant DNA y 2nd Ed., 
New York: W.H. Freeman and Co., pp. 213-234 (1992), and Winnacker, E.-L., 
From Genes to Clones, New York: VCH Publishers ( 1 987), which are illustrative 
of the many laboratory manuals that detail these techniques and which are 
incorporated by reference herein in their entireties for their relevant disclosures. 

Polypeptides 

In another aspect, the invention relates to polypeptides encoded by the nucleic 
acid molecules of the invention (including polypeptides and amino acid sequences 
encoded by all possible reading frames of the nucleic acid molecules of the 
invention), and to methods of producing such polypeptides. Polypeptides of the 
present invention include purified or isolated natural products, products of 
chemical synthetic procedures, and products produced by recombinant techniques 
from a prokaryotic or eukaryotic host, including, for example, bacterial, yeast, 
insect, mammalian, avian and higher plant cells. 
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The polypeptides of the invention may be produced by synthetic organic 
chemistry, and are preferably produced by standard recombinant methods, 
employing one or more of the host cells of the invention comprising the vectors 
or isolated nucleic acid molecules of the invention. According to the invention, 
polypeptides are produced by cultivating the host cells of the invention (which 
comprise one or more of the nucleic acid molecules of the invention, preferably 
contained within an Expression Vector) under conditions favoring the expression 
of the nucleotide sequence contained on the nucleic acid molecule of the invention, 
such that the polypeptide encoded by the nucleic acid molecule of the invention 
is produced by the host cell. As used herein, "conditions favoring the expression 
of the nucleotide sequence" or "conditions favoring the production of a 
polypeptide" include optimal physical (e.g., temperature, humidity, etc.) and 
nutritional (e.g., culture medium, ionic) conditions required for production of a 
recombinant polypeptide by a given host cell. Such optimal conditions for a 
variety of host cells, including prokaryotic (bacterial), mammalian, insect, yeast, 
and plant cells will be familiar to one of ordinary skill in the art, and may be found, 
for example, in Sambrook, J., et al. , Molecular Cloning, A Laboratory Manual, 
2nd Ed., Cold Spring Harbor, NY: Cold Spring Harbor Laboratory Press, (1989), 
Watson, J.D., et al. , Recombinant DNA t 2nd Ed. , New York: W.H. Freeman and 
Co., and Winnacker, E.-L., From Genes to Clones, New York: VCH Publishers 
(1987). 

In some aspects, it may be desirable to isolate or purify the polypeptides of the 
invention (e.g., for production of antibodies as described below), resulting in the 
production of the polypeptides of the invention in isolated form. The polypeptides 
of the invention can be recovered and purified from recombinant cell cultures by 
well-known methods of protein purification that are routine in the art, including 
ammonium sulfate or ethanol precipitation, acid extraction, anion or cation 
exchange chromatography, phosphocellulose chromatography, hydrophobic 
interaction chromatography, affinity chromatography, hydroxylapatite 
chromatography and lectin chromatography. For example, His6 or GST fusion 
tags on polypeptides made by the methods of the invention may be isolated using 
appropriate affinity chromatography matrices which bind polypeptides bearing 
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His6 or GST tags, as will be familiar to one of ordinary skill in the art. 
Polypeptides of the present invention include naturally purified products, products 
of chemical synthetic procedures, and products produced by recombinant 
techniques from a prokaryotic or eukaryotic host, including, for example, 
bacterial, yeast, higher plant, insect and mammalian cells. Depending upon the 
host employed in a recombinant production procedure, the polypeptides of the 
present invention may be glycosylated or may be non-glycosylated. In addition, 
polypeptides of the invention may also include an initial modified methionine 
residue, in some cases as a result of host-mediated processes. 

Isolated polypeptides of the invention include those comprising the amino acid 
sequences encoded by one or more of the reading frames of the polynucleotides 
comprising one or more of the recombination site-encoding nucleic acid molecules 
of the invention, including those encoding attB 1 , attB2, attP 1 , attVl, attL 1 , atiLl, 
atiRl and atiR2 having the nucleotide sequences set forth in Figure 9 (or 
nucleotide sequences complementary thereto), or fragments, variants, mutants and 
derivatives thereof; the complete amino acid sequences encoded by the 
polynucleotides contained in the deposited clones described herein; the amino acid 
sequences encoded by polynucleotides which hybridize under stringent 
hybridization conditions to polynucleotides having the nucleotide sequences 
encoding the recombination site sequences of the invention as set forth in Figure 9 
(or a nucleotide sequence complementary thereto); or a peptide or polypeptide 
comprising a portion or a fragment of the above polypeptides. The invention also 
relates to additional polypeptides having one or more additional amino acids linked 
(typically by peptidyl bonds to form a nascent polypeptide) to the polypeptides 
encoded by the recombination site nucleotide sequences or the deposited clones. 
Such additional amino acid residues may comprise one or more functional peptide 
sequences, for example one or more fusion partner peptides {e.g., GST, His 6 , Trx, 
etc.) and the like. 

As used herein, the terms "protein," "peptide, ""oligopeptide" and 
"polypeptide" are considered synonymous (as is commonly recognized) and each 
term can be used interchangeably as the context requires to indicate a chain of two 
or more amino acids, preferably five or more amino acids, or more preferably ten 
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or more amino acids, coupled by (a) peptidyl linkage(s), unless otherwise defined 
in the specific contexts below. As is commonly recognized in the art, all 
polypeptide formulas or sequences herein are written from left to right and in the 
direction from amino terminus to carboxy terminus. 

It will be recognized by those of ordinary skill in the art that some amino acid 
sequences of the polypeptides of the invention can be varied without significant 
effect on the structure or function of the polypeptides. If such differences in 
sequence are contemplated, it should be remembered that there will be critical 
areas on the protein which determine structure and activity. In general, it is 
possible to replace residues which form the tertiary structure, provided that 
residues performing a similar function are used. In other instances, the type of 
residue may be completely unimportant if the alteration occurs at a non-critical 
region of the polypeptide. 

Thus, the invention further includes variants of the polypeptides of the 
invention, including allelic variants, which show substantial structural homology 
to the polypeptides described herein, or which include specific regions of these 
polypeptides such as the portions discussed below. Such mutants may include 
deletions, insertions, inversions, repeats, and type substitutions (for example, 
substituting one hydrophilic residue for another, but not strongly hydrophilic for 
strongly hydrophobic as a rule). Small changes or such "neutral" or "conservative" 
amino acid substitutions will generally have little effect on activity. 

Typical conservative substitutions are the replacements, one for another, 
among the aliphatic amino acids Ala, Val, Leu and He; interchange of the 
hydroxylated residues Ser and Thr; exchange of the acidic residues Asp and Glu; 
substitution between the amidated residues Asn and Gin; exchange of the basic 
residues Lys and Arg; and replacements among the aromatic residues Phe and Tyr. 

Thus, the fragment, derivative or analog of the polypeptides of the invention, 
such as those comprising peptides encoded by the recombination site nucleotide 
sequences described herein, may be (i) one in which one or more of the amino acid 
residues are substituted with a conservative or non-conservative amino acid 
residue (preferably a conservative amino acid residue), and such substituted amino 
acid residue may be encoded by the genetic code or may be an amino acid (e.g., 
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desmosine, citrulline, ornithine, etc.) that is not encoded by the genetic code; (ii) 
one in which one or more of the amino acid residues includes a substituent group 
(e.g. , a phosphate, hydroxyl, sulfate or other group) in addition to the normal "R" 
group of the amino acid; (iii) one in which the mature polypeptide is fused with 
another compound, such as a compound to increase the half-life of the polypeptide 
(for example, polyethylene glycol), or (iv) one in which additional amino acids are 
fused to the mature polypeptide, such as an immunoglobulin Fc region peptide, a 
leader or secretory sequence, a sequence which is employed for purification of the 
mature polypeptide (such as GST) or a proprotein sequence. Such fragments, 
derivatives and analogs are intended to be encompassed by the present invention, 
and are within the scope of those skilled in the art from the teachings herein and 
the state of the art at the time of invention. 

The polypeptides of the present invention are preferably provided in an 
isolated form, and preferably are substantially purified. Recombinantly produced 
versions of the polypeptides of the invention can be substantially purified by the 
one-step method described in Smith and Johnson, Gene 6 7/3 1-40 (1988). As used 
herein, the term "substantially purified" means a preparation of an individual 
polypeptide of the invention wherein at least 50%, preferably at least 60%, 70%, 
or 75% and more preferably at least 80%, 85%, 90%, 91%, 92%, 93%, 94%, 
95%, 96%,, 97%, 98% or 99% (by mass) of contaminating proteins (i.e., those 
that are not the individual polypeptides described herein or fragments, variants, 
mutants or derivatives thereof) have been removed from the preparation. 

The polypeptides of the present invention include those which are at least 
about 50% identical, at least 60% identical, at least 65% identical, more preferably 
at least about 70%, at least about 75%, at least about 80%, at least about 85%, 
at least about 90%, at least about 95%, at least about 96%, at least about 97%, 
at least about 98% or at least about 99% identical, to the polypeptides described 
herein. For example, preferred attBl -containing polypeptides of the invention 
include those that are at least about 50% identical, at least 60% identical, at least 
65% identical, more preferably at least about 70%, at least about 75%, at least 
about 80%, at least about 85%, at least about 90%, at least about 95%, at least 
about 96%, at least about 97%, at least about 98% or at least about 99% identical, 
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to the polypeptide(s) encoded by the three reading frames of a polynucleotide 
comprising a nucleotide sequence of attBl having a nucleic acid sequence as set 
forth in Figure 9 (or a nucleic acid sequence complementary thereto), to a 
polypeptide encoded by a polynucleotide contained in the deposited cDNA clones 
described herein, or to a polypeptide encoded by a polynucleotide hybridizing 
under stringent conditions to a polynucleotide comprising a nucleotide sequence 
of attBl having a nucleic acid sequence as set forth in Figure 9 (or a nucleic acid 
sequence complementary thereto). Analogous polypeptides may be prepared that 
are at least about 65% identical, more preferably at least about 70%, at least about 
75%, at least about 80%, at least about 85%, at least about 90%, at least about 
95%, at least about 96%, at least about 97%, at least about 98% or at least about 
99% identical, to the attBl, attVl, attPl, attLl, attL2, atiRl and attR2 
polypeptides of the invention as depicted in Figure 9. The present polypeptides 
also include portions or fragments of the above-described polypeptides with at 
least 5,10, 15, 20, or 25 amino acids. 

By a polypeptide having an amino acid sequence at least, for example, 65% 
"identical" to a reference amino acid sequence of a given polypeptide of the 
invention is intended that the amino acid sequence of the polypeptide is identical 
to the reference sequence except that the polypeptide sequence may include up to 
35 amino acid alterations per each 100 amino acids of the reference amino acid 
sequence of a given polypeptide of the invention. In other words, to obtain a 
polypeptide having an amino acid sequence at least 65% identical to a reference 
amino acid sequence, up to 35% of the amino acid residues in the reference 
sequence may be deleted or substituted with another amino acid, or a number of 
amino acids up to 35% of the total amino acid residues in the reference sequence 
may be inserted into the reference sequence. These alterations of the reference 
sequence may occur at the amino (N-) or carboxy (C-) terminal positions of the 
reference amino acid sequence or anywhere between those terminal positions, 
interspersed either individually among residues in the reference sequence or in one 
or more contiguous groups within the reference sequence. As a practical matter, 
whether a given amino acid sequence is, for example, at least 65% identical to the 
amino acid sequence of a given polypeptide of the invention can be determined 
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conventionally using known computer programs such as those described above for 
nucleic acid sequence identity determinations, or more preferably using the 
CLUSTAL W program (Thompson, J.T>, 9 etaL, Nucleic Acids Res. 22:4673-4680 
(1994)). 

The polypeptides of the present invention can be used as molecular weight 
markers on SDS-PAGE gels or on molecular sieve gel filtration columns using 
methods well known to those of skill in the art. In addition, as described in detail 
below, the polypeptides of the present invention can be used to raise polyclonal 
and monoclonal antibodies which are useful in a variety of assays for detecting 
protein expression, localization, detection of interactions with other molecules, or 
for the isolation of a polypeptide (including a fusion polypeptide) of the invention. 

In another aspect, the present invention provides a peptide or polypeptide 
comprising an epitope-bearing portion of a polypeptide of the invention, which 
may be used to raise antibodies, particularly monoclonal antibodies, that bind 
specifically to a one or more of the polypeptides of the invention. The epitope of 
this polypeptide portion is an immunogenic or antigenic epitope of a polypeptide 
of the invention. An "immunogenic epitope" is defined as a part of a protein that 
elicits an antibody response when the whole protein is the immunogen. These 
immunogenic epitopes are believed to be confined to a few loci on the molecule. 
On the other hand, a region of a protein molecule to which an antibody can bind 
is defined as an "antigenic epitope." The number of immunogenic epitopes of a 
protein generally is less than the number of antigenic epitopes (see, e.g., Geysen 
etal, Proc. Natl Acad Set USA 57:3998- 4002 (1983)). 

As to the selection of peptides or polypeptides bearing an antigenic epitope 
(i.e., that contain a region of a protein molecule to which an antibody can bind), 
it is well-known in the art that relatively short synthetic peptides that mimic part 
of a protein sequence are routinely capable of eliciting an antiserum that reacts 
with the partially mimicked protein (see, e.g., Sutcliffe, J.G., et al, Science 
2/9:660-666 (1983)). Peptides capable of eliciting protein-reactive sera are 
frequently represented in the primary sequence of a protein, can be characterized 
by a set of simple chemical rules, and are not confined to the immunodominant 
regions of intact proteins (i.e., immunogenic epitopes) or to the amino or carboxy 
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termini. Peptides that are extremely hydrophobic and those of six or fewer 
residues generally are ineffective at inducing antibodies that bind to the mimicked 
protein; longer peptides, especially those containing proline residues, usually are 
effective (Sutcliffe, J.G., etal, Science 279/660-666 (1983)). 

Epitope-b earing peptides and polypeptides of the invention designed according 
to the above guidelines preferably contain a sequence of at least five, more 
preferably at least seven or more amino acids contained within the amino acid 
sequence of a polypeptide of the invention. However, peptides or polypeptides 
comprising a larger portion of an amino acid sequence of a polypeptide of the 
invention, containing about 30 to about 50 amino acids, or any length up to and 
including the entire amino acid sequence of a given polypeptide of the invention, 
also are considered epitope-bearing peptides or polypeptides of the invention and 
also are useful for inducing antibodies that react with the mimicked protein. 
Preferably, the amino acid sequence of the epitope-bearing peptide is selected to 
provide substantial solubility in aqueous solvents {i.e., the sequence includes 
relatively hydrophilic residues and highly hydrophobic sequences are preferably 
avoided); sequences containing proline residues are particularly preferred. 

Non-limiting examples of epitope-bearing polypeptides or peptides that can be 
used to generate antibodies specific for the polypeptides of the invention include 
certain epitope-bearing regions of the polypeptides comprising amino acid 
sequences encoded by polynucleotides comprising one or more of the 
recombination site-encoding nucleic acid molecules of the invention, including 
those encoding attBl, attBl, attVl, attPl, atiLl, a«L2, attELl and attR2 having 
the nucleotide sequences set forth in Figure 9 (or a nucleotide sequence 
complementary thereto); the complete amino acid sequences encoded by the three 
reading frames of the polynucleotides contained in the deposited clones described 
herein; and the amino acid sequences encoded by all reading frames of 
polynucleotides which hybridize under stringent hybridization conditions to 
polynucleotides having the nucleotide sequences encoding the recombination site 
sequences (or portions thereof) of the invention as set forth in Figure 9 (or a 
nucleic acid sequence complementary thereto). Other epitope-bearing polypeptides 
or peptides that may be used to generate antibodies specific for the polypeptides 
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of the invention will be apparent to one of ordinary skill in the art based on the 
primary amino acid sequences of the polypeptides of the invention described 
herein, via the construction of Kyte-Doolittle hydrophilicity and Jameson-Wolf 
antigenic index plots of the polypeptides of the invention using, for example, 
PROTEAN computer software (DNASTAR, Inc.; Madison, Wisconsin). 

The epitope-bearing peptides and polypeptides of the invention may be 
produced by any conventional means for making peptides or polypeptides 
including recombinant means using nucleic acid molecules of the invention. For 
instance, a short epitope-bearing amino acid sequence may be fused to a larger 
polypeptide which acts as a carrier during recombinant production and 
purification, as well as during immunization to produce anti-peptide antibodies. 
Epitope-bearing peptides also may be synthesized using known methods of 
chemical synthesis (see, e.g., U.S. Patent No. 4,631,211 and Houghten, R. A., 
Proc. Natl Acad. Set USA 52:5131-5135 (1985), both of which are incorporated 
by reference herein in their entireties). 

As one of skill in the art will appreciate, the polypeptides of the present 
invention and epitope-bearing fragments thereof may be immobilized onto a solid 
support, by techniques that are well-known and routine in the art. By "solid 
support" is intended any solid support to which a peptide can be immobilized. 
Such solid supports include, but are not limited to nitrocellulose, diazocellulose, 
glass, polystyrene, polyvinylchloride, polypropylene, polyethylene, dextran, 
Sepharose, agar, starch, nylon, beads and microtitre plates. Linkage of the peptide 
of the invention to a solid support can be accomplished by attaching one or both 
ends of the peptide to the support. Attachment may also be made at one or more 
internal sites in the peptide. Multiple attachments (both internal and at the ends of 
the peptide) may also be used according to the invention. Attachment can be via 
an amino acid linkage group such as a primary amino group, a carboxyl group, or 
a sulfhydryl (SH) group or by chemical linkage groups such as with cyanogen 
bromide (CNBr) linkage through a spacer. For non-covalent attachments to the 
support, addition of an affinity tag sequence to the peptide can be used such as 
GST (Smith, D.B., and Johnson, K.S., Gene 67:31 (1988)), polyhistidines 
(Hochuli, E., et al, J. Chromatog. 411:11 (1987)), or biotin. Such affinity tags 
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may be used for the reversible attachment of the peptide to the support. Such 
immobilized polypeptides or fragments may be useful, for example, in isolating 
antibodies directed against one or more of the polypeptides of the invention, or 
other proteins or peptides that recognize other proteins or peptides that bind to 
one or more of the polypeptides of the invention, as described below. 

As one of skill in the art will also appreciate, the polypeptides of the present 
invention and the epitope-bearing fragments thereof described herein can be 
combined with one or more fusion partner proteins or peptides, or portions 
thereof, including but not limited to GST, His 6 , Trx, and portions of the constant 
domain of immunoglobulins (Ig), resulting in chimeric or fusion polypeptides. 
These fusion polypeptides facilitate purification of the polypeptides of the 
invention (EP 0 394 827; Traunecker et al, Nature 337:84- 86 (1988)) for use in 
analytical or diagnostic (including high-throughput) format. 

Antibodies 

In another aspect, the invention relates to antibodies that recognize and bind 
to the polypeptides (or epitope-bearing fragments thereof) or nucleic acid 
molecules (or portions thereof) of the invention. In a related aspect, the invention 
relates to antibodies that recognize and bind to one or more polypeptides encoded 
by all reading frames of one or more recombination site nucleic acid sequences or 
portions thereof, or to one or more nucleic acid molecules comprising one or more 
recombination site nucleic acid sequences or portions thereof, including but not 
limited to att sites (including attB 1 , attQl, atiP 1 , attP2, attL 1 , attLl, atfRl , attKZ 
and the like), lox sites (e.g., loxP, loxP5\ 1, and the like), FRT, and the like, or 
mutants, fragments, variants and derivatives thereof See generally U.S. Patent 
No. 5,888,732, which is incorporated herein by reference in its entirety. The 
antibodies of the present invention may be polyclonal or monoclonal, and may be 
prepared by any of a variety of methods and in a variety of species according to 
methods that are well-known in the art. See, for instance, U.S. Patent No. 
5,587,287; Sutcliffe, J.G., etal, Science 279:660-666 (1983); Wilson etal, Cell 
37: 767 (1984); and Bittle, F.J., et al, J. Gen. Virol 66:2347-2354 (1985). 
Antibodies specific for any of the polypeptides or nucleic acid molecules described 
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herein, such as antibodies specifically binding to one or more of the polypeptides 
encoded by the recombination site nucleotide sequences, or one or more nucleic 
acid molecules, described herein or contained in the deposited clones, antibodies 
against fusion polypeptides (e.g., binding to fusion polypeptides between one or 
more of the fusion partner proteins and one or more of the recombination site 
polypeptides of the invention, as described herein), and the like, can be raised 
against the intact polypeptides or polynucleotides of the invention or one or more 
antigenic polypeptide fragments thereof 

As used herein, the term "antibody" (Ab) may be used interchangeably with 
the terms "polyclonal antibody" or "monoclonal antibody" (mAb), except in 
specific contexts as described below. These terms, as used herein, are meant to 
include intact molecules as well as antibody fragments (such as, for example, Fab 
and F(ab*) 2 fragments) which are capable of specifically binding to a polypeptide 
or nucleic acid molecule of the invention or a portion thereof It will therefore be 
appreciated that, in addition to the intact antibodies of the invention, Fab, F(ab') 2 
and other fragments of the antibodies described herein, and other peptides and 
peptide fragments that bind one or more polypeptides or polynucleotides of the 
invention, are also encompassed within the scope of the invention. Such antibody 
fragments are typically produced by proteolytic cleavage of intact antibodies, using 
enzymes such as papain (to produce Fab fragments) or pepsin (to produce F(ab f ) 2 
fragments). Antibody fragments, and peptides or peptide fragments, may also be 
produced through the application of recombinant DNA technology or through 
synthetic chemistry. 

Epitope-bearing peptides and polypeptides, and nucleic acid molecules or 
portions thereof, of the invention may be used to induce antibodies according to 
methods well known in the art, as generally described herein (see, e.g., Sutcliffe, 
et aL, supra, Wilson, et al, supra, and Bittle, F. J., et al. } J. Gen. Virol. 
66:2347-2354 (1985)). 

Polyclonal antibodies according to this aspect of the invention may be made 
by immunizing an animal with one or more of the polypeptides or nucleic acid 
molecules of the invention described herein or portions thereof according to 
standard techniques (see, e.g., Harlow, E., and Lane, D., Antibodies; A 
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Laboratory Manual, Cold Spring Harbor, NY: Cold Spring Harbor Laboratory 
Press (1988), Kaufman, P.B., et aL, In: Handbook of Molecular and Cellular 
Methods in Biology and Medicine, Boca Raton, Florida: CRC Press, pp. 468-469 
(1995)). For producing antibodies that recognize and bind to the polypeptides or 
nucleic acid molecules of the invention or portions thereof, animals may be 
immunized with free peptide or free nucleic acid molecules; however, antibody 
titer may be boosted by coupling of the peptide to a macromolecular carrier, such 
as albumin, KLH, or tetanus toxoid (particularly for producing antibodies against 
the nucleic acid molecules of the invention or portions thereof; see Harlow and 
Lane, supra, at page 154), or to a solid phase carrier such as a latex or glass 
microbead. For instance, peptides containing cysteine may be coupled to carrier 
using a linker such as m-maleimidobenzoyl-N- hydroxysuccinimide ester (MBS), 
while other peptides may be coupled to carrier using a more general linking agent 
such as glutaraldehyde. Animals such as rabbits, rats and mice may be immunized 
with either free (if the polypeptide immunogen is larger than about 25 amino acids 
in length) or carrier-coupled peptides or nucleic acid molecules, for instance, by 
intraperitoneal and/or intradermal injection of emulsions containing about 100 \xg 
peptide, polynucleotide, or carrier protein, and Freund's adjuvant. Several booster 
injections may be needed, for instance, at intervals of about two weeks, to provide 
a useful titer of antibody which can be detected, for example, by ELISA assay 
using free peptide or nucleic acid molecule adsorbed to a solid surface. In another 
approach, cells expressing one or more of the polypeptides or polynucleotides of 
the invention or an antigenic fragment thereof can be administered to an animal in 
order to induce the production of sera containing polyclonal antibodies, according 
to routine immunological methods. In yet another method, a preparation of one 
or more of the polypeptides or polynucleotides of the invention is prepared and 
purified as described herein, to render it substantially free of natural contaminants. 
Such a preparation may then be introduced into an animal in order to produce 
polyclonal antisera of greater specific activity. The titer of antibodies in serum 
from an immunized animal, regardless of the method of immunization used, may 
be increased by selection of anti-peptide or anti-polynucleotide antibodies, for 
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instance, by adsorption to the peptide or polynucleotide on a solid support and 
elution of the selected antibodies according to methods well known in the art. 

In an alternative method, the antibodies of the present invention are 
monoclonal antibodies (or fragments thereof which bind to one or more of the 
polypeptides of the invention). Such monoclonal antibodies can be prepared using 
hybridoma technology (Kohler et aL, Nature 256:495 (1975); Kohler et al, Eur. 
J. Immunol 6:511 (1976); Kohler et al 9 Eur. J. Immunol 6:292 (1976); 
Hammerlinge/a/., In: Monoclonal Antibodies and ' T-Cell Hybridomas, Elsevier, 
N.Y., pp. 563-681 (1981)). In general, such procedures involve immunizing an 
animal (preferably a mouse) with a polypeptide or polynucleotide of the invention 
(or a fragment thereof), or with a cell expressing a polypeptide or polynucleotide 
of the invention (or a fragment thereof). The splenocytes of such mice are 
extracted and fused with a suitable myeloma cell line. Any suitable myeloma cell 
line may be employed in accordance with the present invention; however, it is 
preferable to employ the parent myeloma cell line (SP 2 0), available from the 
American Type Culture Collection, Rockville, Maryland. After fusion, the 
resulting hybridoma cells are selectively maintained in HAT medium, and then 
cloned by limiting dilution as described by Wands et al {Gastroenterol 
50:225-232 (1981)). The hybridoma cells obtained through such a selection are 
then assayed to identify clones which secrete antibodies capable of binding one or 
more of the polypeptides or nucleic acid molecules of the invention, or fragments 
thereof. Hence, the present invention also provides hybridoma cells and cell lines 
producing monoclonal antibodies of the invention, particularly that recognize and 
bind to one or more of the polypeptides or nucleic acid molecules of the invention. 

Alternatively, additional antibodies capable of binding to one or more of the 
polypeptides of the invention, or fragments thereof, may be produced in a 
two-step procedure through the use of anti-idiotypic antibodies. Such a method 
makes use of the fact that antibodies are themselves antigens, and that, therefore, 
it is possible to obtain an antibody which binds to a second antibody. In 
accordance with this method, antibodies specific for one or more of the 
polypeptides or polynucleotides of the invention, prepared as described above, are 
used to immunize an animal, preferably a mouse. The splenocytes of such an 
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animal are then used to produce hybridoma cells, and the hybridoma cells are 
screened to identify clones which produce an antibody whose ability to bind to an 
antibody specific for one or more of the polypeptides or polynucleotides of the 
invention can be blocked by polypeptides of the invention themselves. Such 
antibodies comprise anti-idiotypic antibodies to the antibodies recognizing one or 
more of the polypeptides or polynucleotides of the invention, and can be used to 
immunize an animal to induce formation of further antibodies specific for one or 
more of the polypeptides or polynucleotides of the invention. 

For use, the antibodies of the invention may optionally be detectably labeled 
by covalent or non-covalent attachment of one or more labels, including but not 
limited to chromogenic, enzymatic, radioisotopic, isotopic, fluorescent, toxic, 
chemiluminescent, or nuclear magnetic resonance contrast agents or other labels. 

Examples of suitable enzyme labels include malate dehydrogenase, 
staphylococcal nuclease, delta-5-steroid isomerase, yeast-alcohol dehydrogenase, 
alpha-glycerol phosphate dehydrogenase, triose phosphate isomerase, peroxidase, 
alkaline phosphatase, asparaginase, glucose oxidase, beta-galactosidase, 
ribonuclease, urease, catalase, glucose-6-phosphate dehydrogenase, glucoamylase, 
and acetylcholine esterase. 

Examples of suitable radioisotopic labels include 3 H, m In, 125 I, 131 I ? 32 P 5 35 S, 
14 C, 51 Cr, 57 To, 58 Co, 59 Fe, 75 Se, 152 Eu, 90 Y, 67 Cu, 217 Ci, 211 At, 212 Pb, 47 Sc, 109 Pd, etc. 
lu In is a preferred isotope where in vivo imaging is used since its avoids the 
problem of dehalogenation of the 125 I or 131 I-labeled monoclonal antibody by the 
liver. In addition, this radionucleotide has a more favorable gamma emission 
energy for imaging (Perkins et aL, Eur. J. Nucl. Med. 70:296-301 (1985); 
Carasquillo etal, J. Nucl Med. 25:281-287 (1987)). For example, in In coupled 
to monoclonal antibodies with 1 -(P-isothiocyanatobenzyl)-DPTA has shown little 
uptake in non-tumorous tissues, particularly the liver, and therefore enhances 
specificity of tumor localization (Esteban et al., J. Nucl. Med. 28:861-870 (1987)). 

Examples of suitable non-radioactive isotopic labels include 157 Gd, 55 Mn, 162 Dy, 
52 Tr, and 56 Fe. 

Examples of suitable fluorescent labels include an 152 Eu label, a fluorescein 
label, an isothiocyanate label, a rhodamine label, a phycoerythrin label, a 
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phycocyanin label, an allophycocyanin label, an o-phthaldehyde label, a green 
fluorescent protein (GFP) label, and a fluorescamine label. 

Examples of suitable toxin labels include diphtheria toxin, ricin, and cholera 
toxin. 

Examples of chemiluminescent labels include a luminal label, an isoluminal 
label, an aromatic acridinium ester label, an imidazole label, an acridinium salt 
label, an oxalate ester label, a luciferin label, a luciferase label, and an aequorin 
label 

Examples of nuclear magnetic resonance contrasting agents include heavy 
metal nuclei such as Gd, Mn, and iron. 

Typical techniques for binding the above-described labels to the antibodies of 
the invention are provided by Kennedy et al, Clin. Chim. Acta 70:1-31 (1976), 
and Schurs et aL, Clin. Chim. Acta 57:1-40 (1977). Coupling techniques 
mentioned in the latter are the glutaraldehyde method, the periodate method, the 
dimaleimide method, the m-maleimidobenzyl-N-hydroxy-succinimide ester 
method, all of which methods are incorporated by reference herein. 

It will be appreciated by one of ordinary skill that the antibodies of the present 
invention may alternatively be coupled to a solid support, to facilitate, for 
example, chromatographic and other immunological procedures using such solid 
phase-immobilized antibodies. Included among such procedures are the use of the 
antibodies of the invention to isolate or purify polypeptides comprising one or 
more epitopes encoded by the nucleic acid molecules of the invention (which may 
be fusion polypeptides or other polypeptides of the invention described herein), 
or to isolate or purify polynucleotides comprising one or more recombination site 
sequences of the invention or portions thereof. Methods for isolation and 
purification of polypeptides (and, by analogy, polynucleotides) by affinity 
chromatography, for example using the antibodies of the invention coupled to a 
solid phase support, are well-known in the art and will be familiar to one of 
ordinary skill. The antibodies of the invention may also be used in other 
applications, for example to cross-link or couple two or more proteins, 
polypeptides, polynucleotides, or portions thereof into a structural and/or 
functional complex. In one such use, an antibody of the invention may have two 
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or more distinct epitope-binding regions that may bind, for example, a first 
polypeptide (which may be a polypeptide of the invention) at one epitope-binding 
region on the antibody and a second polypeptide (which may be a polypeptide of 
the invention) at a second epitope-binding region on the antibody, thereby bringing 
the first and second polypeptides into close proximity to each other such that the 
first and second polypeptides are able to interact structurally and/or functionally 
(as, for example, linking an enzyme and its substrate to carry out enzymatic 
catalysis, or linking an effector molecule and its receptor to carry out or induce a 
specific binding of the effector molecule to the receptor or a response to the 
effector molecule mediated by the receptor). Additional applications for the 
antibodies of the invention include, for example, the preparation of large-scale 
arrays of the antibodies, polypeptides, or nucleic acid molecules of the invention, 
or portions thereof, on a solid support, for example to facilitate high-throughput 
screening of protein or RNA expression by host cells containing nucleic acid 
molecules of the invention (known in the art as "chip array" protocols; see, e.g., 
U.S. Patent Nos. 5,856,101, 5,837,832, 5,770,456, 5,744,305, 5,631,734, and 
5,593,839, which are directed to production and use of chip arrays of polypeptides 
(including antibodies) and polynucleotides, and the disclosures of which are 
incorporated herein by reference in their entireties). By "solid support" is intended 
any solid support to which an antibody can be immobilized. Such solid supports 
include, but are not limited to nitrocellulose, diazocellulose, glass, polystyrene, 
polyvinylchloride, polycarbonate, polypropylene, polyethylene, dextran, 
Sepharose, agar, starch, nylon, beads and microtitre plates. Preferred are beads 
made of glass, latex or a magnetic material. Linkage of an antibody of the 
invention to a solid support can be accomplished by attaching one or both ends of 
the antibody to the support. Attachment may also be made at one or more internal 
sites in the antibody. Multiple attachments (both internal and at the ends of the 
antibody) may also be used according to the invention. Attachment can be via an 
amino acid linkage group such as a primary amino group, a carboxyl group, or a 
sulfhydryl (SH) group or by chemical linkage groups such as with cyanogen 
bromide (CNBr) linkage through a spacer. For non-covalent attachments, 
addition of an affinity tag sequence to the peptide can be used such as GST 
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(Smith,D.B., and Johnson, K.S., Gene 6731 (1988)), polyhistidines (Hochuli, E., 
et al, J. Chromatog. 411:11 (1987)), or biotin. Alternatively, attachment can be 
accomplished using a ligand which binds the Fc region of the antibodies of the 
invention, e.g., protein A or protein G. Such affinity tags may be used for the 
reversible attachment of the antibodies to the support. Peptides may also be 
recognized via specific ligand-receptor interactions or using phage display 
methodologies that will be familiar to the skilled artisan, for their ability to bind 
polypeptides of the invention or fragments thereof 

Kits 

In another aspect, the invention provides kits which may be used in producing 
the nucleic acid molecules, polypeptides, vectors, host cells, and antibodies, and 
in the recombinational cloning methods, of the invention. Kits according to this 
aspect of the invention may comprise one or more containers, which may contain 
one or more of the nucleic acid molecules, primers, polypeptides, vectors, host 
cells, or antibodies of the invention. In particular, a kit of the invention may 
comprise one or more components (or combinations thereof) selected from the 
group consisting of one or more recombination proteins (e.g., Int) or auxiliary 
factors (e.g. IHF and/or Xis) or combinations thereof, one or more compositions 
comprising one or more recombination proteins or auxiliary factors or 
combinations thereof (for example, GATEWAY™ LR Clonase™ Enzyme Mix or 
GATEWAY™ BP Clonase™ Enzyme Mix) one or more Destination Vector 
molecules (including those described herein), one or more Entry Clone or Entry 
Vector molecules (including those described herein), one or more primer nucleic 
acid molecules (particularly those described herein), one or more host cells (e.g. 
competent cells, such as E. coli cells, yeast cells, animal cells (including 
mammalian cells, insect cells, nematode cells, avian cells, fish cells, etc.), plant 
cells, and most particularly E. coli DB3, DB3.1 (preferably E. coli LIBRARY 
EFFICIENCY® DB3.1™ Competent Cells; Life Technologies, Inc., Rockville, 
MD), DB4 and DB5; see U.S. Provisional Application No. 60/122,392, filed on 

March 2, 1999, and the corresponding U.S. Utility Application No. 

of Hartley et al , entitled "Cells Resistant to Toxic Genes and Uses Thereof," filed 
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on even day herewith, the disclosures of which are incorporated by reference 
herein in its entirety), and the like. In related aspects, the kits of the invention may 
comprise one or more nucleic acid molecules encoding one or more recombination 
sites or portions thereof, such as one or more nucleic acid molecules comprising 
a nucleotide sequence encoding the one or more recombination sites (or portions 
thereof) of the invention, and particularly one or more of the nucleic acid 
molecules contained in the deposited clones described herein. Kits according to 
this aspect of the invention may also comprise one or more isolated nucleic acid 
molecules of the invention, one or more vectors of the invention, one or more 
primer nucleic acid molecules of the invention, and/or one or more antibodies of 
the invention. The kits of the invention may further comprise one or more 
additional containers containing one or more additional components useful in 
combination with the nucleic acid molecules, polypeptides, vectors, host cells, or 
antibodies of the invention, such as one or more buffers, one or more detergents, 
one or more polypeptides having nucleic acid polymerase activity, one or more 
polypeptides having reverse transcriptase activity, one or more transfection 
reagents, one or more nucleotides, and the like. Such kits may be used in any 
process advantageously using the nucleic acid molecules, primers, vectors, host 
cells, polypeptides, antibodies and other compositions of the invention, for 
example in methods of synthesizing nucleic acid molecules (e.g., via amplification 
such as via PCR), in methods of cloning nucleic acid molecules (preferably via 
recombinational cloning as described herein), and the like. 

Optimization of Recombinational Cloning System 

The usefulness of a particular nucleic acid molecule, or vector comprising a 
nucleic acid molecule, of the invention in methods of recombinational cloning may 
be determined by any one of a number of assay methods. For example, Entry and 
Destination vectors of the present invention may be assessed for their ability to 
function (i.e., to mediate the transfer of a nucleic acid molecule, DNA segment, 
gene, cDNA molecule or library from a cloning vector to an Expression Vector) 
by carrying out a recombinational cloning reaction as described in more detail in 
the Examples below and as described in U.S. Application Nos. 08/663,002, filed 
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June 7, 1996 (now U.S. Patent No. 5,888,732), 09/005,476, filed January 12, 
1998, 09/177,387, filed October 23, 1998, and 60/108,324, filed November 13, 
1998, the disclosures of which are incorporated by reference herein in their 
entireties. Alternatively, the functionality of Entry and Destination Vectors 
prepared according to the invention may be assessed by examining the ability of 
these vectors to recombine and create cointegrate molecules, or to transfer a 
nucleic acid molecule of interest, using an assay such as that described in detail 
below in Example 19. Analogously, the formulation of compositions comprising 
one or more recombination proteins or combinations thereof, for example 
GATEWAY™ LR Clonase™ Enzyme Mix and GATEWAY™ BP Clonase™ 
Enzyme Mix, may be optimized using assays such as those described below in 
Example 18. 

Uses 

There are a number of applications for the compositions, methods and kits of 
the present invention. These uses include, but are not limited to, changing vectors, 
targeting gene products to intracellular locations, cleaving fusion tags from desired 
proteins, operably linking nucleic acid molecules of interest to regulatory genetic 
sequences (e.g. , promoters, enhancers, and the like), constructing genes for fusion 
proteins, changing copy number, changing replicons, cloning into phages, and 
cloning, e.g. , PCR products, genomic DNAs, and cDNAs. In addition, the nucleic 
acid molecules, vectors, and host cells of the invention may be used in the 
production of polypeptides encoded by the nucleic acid molecules, in the 
production of antibodies directed against such polypeptides, in recombinational 
cloning of desired nucleic acid sequences, and in other applications that may be 
enhanced or facilitated by the use of the nucleic acid molecules, vectors, and host 
cells of the invention. 

In particular, the nucleic acid molecules, vectors, host cells, polypeptides, 
antibodies, and kits of the invention may be used in methods of transferring one 
or more desired nucleic acid molecules or DNA segments, for example one or 
more genes, cDNA molecules or cDNA libraries, into a cloning or Expression 
Vector for use in transforming additional host cells for use in cloning or 
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amplification of, or expression of the polypeptide encoded by, the desired nucleic 
acid molecule or DNA segment. Such recombinational cloning methods which 
may advantageously use the nucleic acid molecules, vectors, and host cells of the 
invention, are described in detail in the Examples below, and in commonly owned 
U.S. Application Nos. 08/486,139, filed June 7, 1995, 08/663,002, filed June 7, 
1996 (now U.S. Patent No. 5,888,732), 09/005,476, filed January 12, 1998, 
09/177,387, filed October 23, 1998, and 60/108,324, filed November 13, 1998, 
the disclosures of all of which are incorporated by reference herein in their 
entireties. 

It will be understood by one of ordinary skill in the relevant arts that other 
suitable modifications and adaptations to the methods and applications described 
herein are readily apparent from the description of the invention contained herein 
in view of information known to the ordinarily skilled artisan, and may be made 
without departing from the scope of the invention or any embodiment thereof 
Having now described the present invention in detail, the same will be more clearly 
understood by reference to the following examples, which are included herewith 
for purposes of illustration only and are not intended to be limiting of the 
invention. 



Examples 

Example 1: Recombination Reactions of Bacteriophage A 

The E. coli bacteriophage X can grow as a lytic phage, in which case the host 
cell is lysed, with the release of progeny virus. Alternatively, lambda can integrate 
into the genome of its host by a process called lysogenization (see Figure 60). In 
this lysogenic state, the phage genome can be transmitted to daughter cells for 
many generations, until conditions arise that trigger its excision from the genome. 
At this point, the virus enters the lytic part of its life cycle. The control of the 
switch between the lytic and lysogenic pathways is one of the best understood 
processes in molecular biology (M. Ptashne, A Genetic Switch, Cell Press, 1992). 
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The integrative and excisive recombination reactions of A, performed in vitro, 
are the basis of Recombinational Cloning System of the present invention. They 
can be represented schematically as follows: 



attB x attP <-* attL x attR (where "x" signifies recombination) 

The four att sites contain binding sites for the proteins that mediate the 
reactions. The wild type attP, attB, attL, and attR sites contain about 243, 25, 
100, and 168 base pairs, respectively. The attB x attP reaction (hereinafter 
referred to as a "BP Reaction," or alternatively and equivalently as an "Entry 
Reaction" or a "Gateward Reaction") is mediated by the proteins Int and IHF. 
The attL x attR reaction (hereinafter referred to as an "LR Reaction," or 
alternatively and equivalently as a "Destination Reaction") is mediated by the 
proteins Int, IHF, and Xis. Int (integrase) and Xis (excisionase) are encoded by 
the X genome, while IHF (integration host factor) is an E. coli protein. For a 
general review of lambda recombination, see. A. Landy, Ann. Rev. Biochem. 58 : 
913-949 (1989). 

Example 2: Recombination Reactions of the Recombinational Cloning 
System 

The LR Reaction -- the exchange of a DNA segment from an Entry Clone to 
a Destination Vector ~ is the in vitro version of the A excision reaction: 



attL x attR =» attB + attP. 



There is a practical imperative for this configuration: after an LR Reaction in 
one configuration of the present method, an att site usually separates a functional 
motif (such as a promoter or a fusion tag) from a nucleic acid molecule of interest 
in an Expression Clone, and the 25 bp attB site is much smaller than the attP, attL, 
and attR sites. 

Note that the recombination reaction is conservative, i.e., there is no net 
synthesis or loss of base pairs. The DNA segments that flank the recombination 
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sites are merely switched. The wild type X recombination sites are modified for 
purposes of the GATEWAY™ Cloning System, as follows: 

To create certain preferred Destination Vectors, a part (43 bp) of attR was 
removed, to make the excisive reaction irreversible and more efficient (W. 
Bushman et al., Science 230: 906, 1985). The attR sites in preferred Destination 
Vectors of the invention are 125 bp in length. Mutations were made to the core 
regions of the att sites, for two reasons: (1) to eliminate stop codons, and (2) to 
ensure specificity of the recombination reactions (i.e., attRl reacts only with 
attLl, attR2 reacts only with attL2, etc.). 

Other mutations were introduced into the short (5 bp) regions flanking the 
1 5 bp core regions of the attB sites to minimize secondary structure formation in 
single-stranded forms of attB plasmids, e.g., in phagemid ssDNA or in mRNA. 
Sequences of attB 1 and attB2 to the left and right of a nucleic acid molecule of 
interest after it has been cloned into a Destination Vector are given in Figure 6. 

Figure 61 illustrates how an Entry Clone and a Destination Vector recombine 
in the LR Reaction to form a co-integrate, which resolves through a second 
reaction into two daughter molecules. The two daughter molecules have the same 
general structure regardless of which pair of sites, attLl and attRl or attL2 and 
attR2, react first to form the co-integrate. The segments change partners by these 
reactions, regardless of whether the parental molecules are both circular, one is 
circular and one is linear, or both are linear. In this example, selection for 
ampicillin resistance carried on the Destination Vector, which also carries the 
death gene ccdB, provides the means for selecting only for the desired attB 
product plasmid. 

Example 3: Protein Expression in the Recombinational Cloning System 

Proteins are expressed in vivo as a result of two processes, transcription (DNA 
into RNA), and translation (RNA into protein). For a review of protein 
expression in prokaryotes and eukaryotes, see Example 13 below. Many vectors 
(pUC, BlueScript, pGem) use interruption of a transcribed lacZ gene for blue- 
white screening. These plasmids, and many Expression Vectors, use the lac 
promoter to control expression of cloned genes. Transcription from the lac 
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promoter is turned on by adding the inducer IPTG. However, a low level of RNA 
is made in the absence of inducer, i.e., the lac promoter is never completely off. 
The result of this "leakiness" is that genes whose expression is harmful to E. coli 
may prove difficult or impossible to clone in vectors that contain the lac promoter, 
or they may be cloned only as inactive mutants. 

In contrast to other gene expression systems, nucleic acid molecules cloned 
into an Entry Vector may be designed not to be expressed. The presence of the 
strong transcriptional terminator rrnB (Orosz, et al., Eur. J, Biochem. 201; 653, 
1991) just upstream of the attLl site keeps transcription from the vector 
promoters (drug resistance and replication origin) from reaching the cloned gene. 
However, if a toxic gene is cloned into a Destination Vector, the host may be sick, 
just as in other expression systems. But the reliability of subcloning by in vitro 
recombination makes it easier to recognize that this has happened — and easier to 
try another expression option in accordance with the methods of the invention, if 
necessary. 

Example 4: Choosing the Right Entry Vector 

There are two kinds of choices that must be made in choosing the best Entry 
Vector, dictated by (1) the particular DNA segment that is to be cloned, and (2) 
what is to be accomplished with the cloned DNA segment. These factors are 
critical in the choice of Entry Vector used, because when the desired nucleic acid 
molecule of interest is moved from the Entry Vector to a Destination Vector, all 
the base pairs between the nucleic acid molecule of interest and the Int cutting 
sites in attLl and attL2 (such as in Figure 6) move into the Destination Vector 
as well. For genomic DNAs that are not expressed as a result of moving into a 
Destination Vector, these decisions are not as critical. 

For example, if an Entry Vector with certain translation start signals is used, 
those sequences will be translated into amino acids if an amino-terminal fusion to 
the desired nucleic acid molecule of interest is made. Whether the desired nucleic 
acid molecule of interest is to be expressed as fusion protein, native protein, or 
both, dictates whether translational start sequences must be included between the 
attB sites of the clone (native protein) or, alternatively, supplied by the Destination 



WO 00/52027 PCT/USOO/05432 

-100- 

Vector (fusion protein). In particular, Entry Clones that include translational start 
sequences may prove less suitable for making fusion proteins, as internal initiation 
of translation at these sites can decrease the yield of N-terminal fusion protein. 
These two types of expression afforded by the compositions and methods of the 
5 invention are illustrated in Figure 62. 

No Entry Vector is likely to be optimal for all applications. The nucleic acid 
molecule of interest may be cloned into any of several optimal Entry Vectors. 

As an example, consider pENTR7 (Figure 16) and pENTRll (Figure 20), 
which are useful in a variety of applications, including (but not limited to): 

10 

• Cloning cDNAs from most of the commercially available libraries. The sites 
to the left and right of the ccdB death gene have been chosen so that 
directional cloning is possible if the DNA to be cloned does not have two or 
more of these restriction sites. 

15 

•Cloning of genes directionally: Sail, BamHl, Xmnl (blunt), or Kpril on the 
left of ccdB; Notl, Xhol, Xbal, or EcoRV (blunt), on the right. 

•Cloning of genes or gene fragments with a blunt amino end at the Xmnl site. 

20 The Xmnl site has four of the six most favored bases for eukaryotic expression 

(see Example 13, below), so that if the first three bases of the DNA to be 
cloned are ATG, the open reading frame (ORF) will be expressed in 
eukaryotic cells (e.g., mammalian cells, insect cells, yeast cells) when it is 
transcribed in the appropriate Destination Vector. In addition, in pENTRl 1 , 

25 a Shine-Dalgarno sequence is situated 8 bp upstream, for initiating protein 

synthesis in a prokaryotic host cell (particularly a bacterial cell, such as E. coli) 
at an ATG. 

•Cleaving off amino terminal fusions (e.g., His 6 , GST, or thioredoxin) using 
30 the highly specific TEV (Tobacco Etch Virus) protease (available from Life 

Technologies, Inc.). If the nucleic acid molecule of interest is cloned at the 
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blunt Xmnl site, TEV cleavage will leave two amino acids on the amino end 
of the expressed protein. 

•Selecting against uncut or singly cut Entry Vector molecules during cloning 
with restriction enzymes and ligase. If the ccdB gene is not removed with a 
double digest, it will kill any recipient E. coli cell that does not contain a 
mutation that makes the cell resistant to ccdB (see U.S. Provisional 
Application No. 60/122,392, filed on March 2, 1999, the disclosure of which 
is incorporated by reference herein in its entirety). 

•Allowing production of amino fusions with ORFs in all cloning sites. There 
are no stop codons (in the attLl reading frame) upstream of the ccdB gene. 

In addition, pENTRl 1 is also useful in the following applications: 

•Cloning cDNAs that have an Ncol site at the initiating ATG into the Ncol 
site. Similar to the Xmnl site, this site has four of the six most favored bases 
for eukaryotic expression. Also, a Shine-Dalgarno sequence is situated 8 bp 
upstream, for initiating protein synthesis in a prokaryotic host cell (particularly 
a bacterial cell, such as E. coli) at an ATG. 

•Producing carboxy fusion proteins with ORFs positioned in phase with the 
reading frame convention for carboxy-terminal fusions (see Figure 20 A). 

Table 1 lists some non-limiting examples of Entry Vectors and their 
characteristics, and Figures 10-20 show their cloning sites. All of the Entry 
Vectors listed in Table 1 are available commercially from Life Technologies, Inc., 
Rockville, Maryland. Other Entry Vectors not specifically listed here, which 
comprise alternative or additional features may be made by one of ordinary skill 
using routine methods of molecular and cellular biology, in view of the disclosure 
contained herein. 
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Entry vectors pENTRl A (Figures 1 OA and 1 OB), pENTR2B (Figures 1 1 A 
and 1 IB), and pENTR3C (Figures 12 A and 12B) are almost identical, except that 
the restriction sites are in different reading frames. Entry vectors pENTR4 
(Figures 13 A and 13B), pENTR5 (Figures 14A and 14B), and pENTR6 
(Figures 1 5 A and 1 5B) are essentially identical to pENTRl A, except that the blunt 
Dral site has been replaced with sites containing the ATG methionine codon: 
Ncol in pENTR4, Ndel in pENTRS, and Sphl in pENTR6. Nucleic acid 
molecules that contain one of these sites at the initiating ATG can be conveniently 
cloned in these Entry vectors. The Ncol site in pENTR4 is especially useful for 
expression of nucleic acid molecules in eukaryotic cells, since it contains many of 
the bases that give efficient translation {see Example 13, below). (Nucleic acid 
molecules of interest cloned into the Ndel site of pENTR5 are not expected to be 
highly expressed in eukaryotic cells, because the cytosine at position -3 from the 
initiating ATG is rare in eukaryotic genes.) 

Entry vectors pENTR7 (Figures 16A and 16B), pENTR8 (Figures 17A 
and 17B), and pENTR9 (Figures 18A and 18B) contain the recognition site for 
the TEV protease between the attLl site and the cloning sites. Cleavage sites for 
Xmnl (blunt), Ncol, and Ndel, respectively, are the most 5' sites in these Entry 
vectors. Amino fusions can be removed efficiently if nucleic acid molecules are 
cloned into these Entry vectors. TEV protease is highly active and highly specific. 

Example 5: Controlling Reading Frame 

One of the trickiest tasks in expression of cloned nucleic acid molecules 
is making sure the reading frame is correct. (Reading frame is important if 
fusions are being made between two ORFs, for example between a nucleic acid 
molecule of interest and a His6 or GST domain.) For purposes of the present 
invention, the following convention has been adopted: The reading frame of the 
DNA cloned into any Entry Vector must be in phase with that of the attB 1 site 
shown in Figure 16 A, pENTR7. Notice that the six As of the attLl site are split 
into two lysine codons (aaa aaa). The Destination Vectors that make amino 
fusions were constructed such that they enter the attRl site in this reading frame. 



WO 00/52027 PCT/US00/05432 

-105- 

Destination Vectors for carboxy terminal fusions were also constructed, including 
those containing His 6 (pDEST23; Figure 43), GST (pDEST24, Figure 44), or 
thioredoxin (pDEST25; Figure 45) C-terminal fusion sequences. 

Therefore, if a nucleic acid molecule of interest is cloned into an Entry 
Vector so that the aaa aaa reading frame within the attLl site is in phase with the 
nucleic acid molecule's ORF, amino terminal fusions will automatically be 
correctly phased, for all the fusion tags. This is a significant improvement over the 
usual case, where each different vector can have different restriction sites and 
different reading frames. 

See Example 15 for a practical example of how to choose the most 
appropriate combinations of Entry Vector and Destination Vector. 



Materials 

Unless otherwise indicated, the following materials were used in the 
remaining Examples included herein: 

5X LR Reaction Buffer: 

200-250 mM (preferably 250 mM) Tris-HCl, pH 7.5 
250-350 mM (preferably 320 mM) NaCl 
1.25-5 mM (preferably 4.75 mM) EDTA 

12.5-35 mM (preferably 22-35 mM, and most preferably 35 mM) 

Spermidine-HCl 

1 mg/ml bovine serum albumin 

GATEWAY™ LR Clonase™ Enzyme Mix: 

per 4 ul of IX LR Reaction Buffer: 

150 ng carboxy-His6-tagged Int (see U.S. Appl. Nos. 60/108,324, filed 

November 13, 1998, and 09/438,358, filed November 12, 
1999, both entirely incorporated by reference herein) 
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25 ng carboxy-His6-tagged Xis (see U.S. Appl. Nos. 60/108,324, filed 
November 13, 1998, and 09/438,358, filed November 12, 
1999, both entirely incorporated by reference herein) 

30 ng IHF 

50% glycerol 

5X BP Reaction Buffer: 

125 mM Tris-HCl, pH 7.5 

110 mMNaCl 

25 mM EDTA 

25 mM Spermidine-HCl 

5 mg/ml bovine serum albumin 

GATEWAY™ BP Clonase™ Enzyme Mix: 

per 4 ul of IX BP Reaction Buffer: 

200 ng carboxy-His6-tagged Int (see U.S. Appl. Nos. 60/108,324, filed 
November 13, 1998, and 09/438,358, filed November 12, 
1999, both entirely incorporated by reference herein) 

80 ng IHF 

50% glycerol 

10X Clonase Stop Solution: 

50 mM Tris-HCl, pH 8.0 

1 mM EDTA 

2 mg/ml Proteinase K 

Example 6: LR ("Destination ") Reaction 

To create a new Expression Clone containing the nucleic acid molecule of 
interest (and which may be introduced into a host cell, ultimately for production 
of the polypeptide encoded by the nucleic acid molecule), an Entry Clone or 
Vector containing the nucleic acid molecule of interest, prepared as described 
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herein, is reacted with a Destination Vector. In the present example, a p-Gal gene 
flanked by attL sites is transferred from an Entry Clone to a Destination Vector. 

Materials needed: 

• 5 X LR Reaction buffer 

• Destination Vector (preferably linearized), 75-150 ng/jil 

• Entry Clone containing nucleic acid molecule of interest, 1 00-300 ng in < 8 \xl 
TE buffer 

• Positive control Entry Clone (pENTR-P-Gal) DNA (See note, below) 

• Positive control Destination Vector, pDESTl (pTrc), 75 ng/|ul 

• GATEWAY™ LR Clonase™ Enzyme Mix (stored at - 80° C) 

• 10X Clonase Stop solution 

• pUC19DNA, lOpg/jil 

• Chemically competent E. coli cells (competence: > lxlO 7 CFU/jig), 400 \A. 

• LB Plates containing ampicillin (100 jig/ml) and methicillin (200 ng/ml) ± 
X-gal and IPTG (See below) 

Notes: 

Preparation of the Entry Clone DNA: Miniprep DNA that has been treated 
with RNase works well. A reasonably accurate quantitation (±50%) of the DNA 
to be cloned is advised, as the GATEWAY™ reaction appears to have an 
optimum of about 100-300 ng of Entry Clone per 20 [xl of reaction mix. 

The positive control Entry Clone, pENTR-P-Gal, permits functional analysis 
of clones based on the numbers of expected blue vs. white colonies on LB plates 
containing IPTG + Bluo-gal (or X-gal), in addition to ampicillin (100 jig/ml) and 
methicillin (200 ^ig/ml). Because p-Galactosidase is a large protein, it often yields 
a less prominent band than many smaller proteins do on SDS protein gels. 

In the Positive Control Entry Vector pENTR- p-Gal, the coding sequence 
of p-Gal has been cloned into pENTRll (Figures 20A and 20B), with 
translational start signals permitting expression in E. coli, as well as in eukaryotic 
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cells. The positive control Destination Vector, for example pDESTl (Figure 21), 
is preferably linearized. 

To prepare X-gal + IPTG plates, either of the following protocols may be 

used: 

A. With a glass rod, spread over the surface of an LB agar plate: 40 ^il of 
20 mg/ml X-gal (or Bluo-gal) in DMF plus 4 \il 200 mg/ml IPTG. Allow liquid 
to adsorb into agar for 3-4 hours at 37° C before plating cells. 

B. To liquid LB agar at -45 °C, add: X-gal (or Bluo-Gal) (20 mg/ml in DMF) 
to make 50 |ig/ml and IPTG (200 mM in water) to make 0.5-1 mM, just prior to 
pouring plates. Store X-gal and Bluo-Gal in a light-shielded container. 

Colony color may be enhanced by placing the plates at 5 °C for a few hours 
after the overnight incubation at 37 °C. Protocol B can give more consistent 
colony color than A, but A is more convenient when selection plates are needed 
on short notice. 

Recombination in Clonase reactions continues for many hours. While 
incubations of 45-60 minutes are usually sufficient, reactions with large DNAs, or 
in which both parental DNAs are supercoiled, or which will be transformed into 
cells of low competence, can be improved with longer incubation times, such as 
2-24 hours at 25 °C. 

Procedure: 

1 . Assemble reactions as follows (combine all components at room temperature, 
except GATEWAY™ LR Clonase™ Enzyme Mix ("Clonase LR"), before 
removing Clonase LR from frozen storage): 
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Tube 1 


Tube 2 


Tube 3 


Tube 4 


Component 


Neg. 


Pos. 


Neg. 


Test 


p-Gate-PGal, (Positive control 
Entry Clone) 73 ng/ul 


4 ul 


4 ul 






pDESTl (Positive control 
Destination Vector), 75 ng/ul 


4 ul 


4 ul 






Your Entry Clone (100-300 ng) 






1 - 8 ul 


1 - 8 |il 


Destination Vector for your nucleic 
acid molecule. 75 ng/jal 






4 ul 


4 ul 


5 X LR Reaction Buffer 


4 ul 


4 ul 


4 ul 


4 ul 


TE 


8 ul 


4 ul 


To 20 ul 


Tol6ul 


GATEWAY™ LR Clonase™ 
Enzyme Mix (store at - 80° C, add 
last) 




4 ul 




4 ul 


Total Volume 


20 ul 


20 ul 


20 ul 


20 ul 



2. Remove the GATEWAY™ LR Clonase™ Enzyme Mix from the -80° C 
freezer, place immediately on ice. The Clonase takes only a few minutes to thaw. 
3 Add 4 (til of GATEWAY™ LR Clonase™ Enzyme Mix to reactions #2 and #4; 

4. Return GATEWAY™ LR Clonase™ Enzyme Mix to - 80° C freezer. 

5. Incubate tubes at 25 ° for at least 60 minutes. 

6. Add 2 |il Clonase Stop solution to all reactions. Incubate for 20 min at 37°C. 
(This step usually increases the total number of colonies obtained by 10-20 fold.) 

7. Transform 2 jllI into 100 jul competent E. coli. Select on plates containing 

ampicillin at 100 |ug/ml. 



Example 7: Transformation of E. coli 

To introduce cloning or Expression Vectors prepared using the 
recombinational cloning system of the invention, any standard E. coli 
transformation protocol should be satisfactory. The following steps are 
recommended for best results: 
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1 Let the mixture of competent cells and Recombinational Cloning System 
reaction product stand on ice at least 1 5 minutes prior to the heat-shock step. 
This gives time for the recombination proteins to dissociate from the DNA, 
and improves the transformation efficiency. 

2. Expect the reaction to be about l%-5% efficient, i.e., 2 jlxI of the reaction 
should contain at least 100 pg of the Expression Clone plasmid (taking into 
account the amounts of each parental plasmid in the reaction, and the 
subsequent dilution). If the E. coli cells have a competence of 10 7 CFU/^g, 
100 pg of the desired clone plasmid will give about 1000 colonies, or more, 
if the entire transformation is spread on one ampicillin plate. 

3. Always do a control pUC DNA transformation. If the number of colonies 
is not what you expect, the pUC DNA transformation gives you an indication 
of where the problem was. 

Example 8: Preparation of attB-PCR Product 

For preparation of attB-PCR products in the PCR cloning methods 
described in Example 9 below, PCR primers containing attB 1 and attB2 sequences 
are used. The attBl and attB2 primer sequences are as follows: 

attBl: 5 '-GGGGACAAGTTTGT AC AAAAAAGCAGGCT- (template-specific 
sequence)-3 ' 

attB2: 5 ' -GGGGACC ACTTTGT AC AAGAAAGCTGGGT-(template-specific 
sequence)-3 ' 

The attB 1 sequence should be added to the amino primer, and the attB2 sequence 
to the carboxy primer. The 4 guanines at the 5' ends of each of these primers 
enhance the efficiency of the minimal 25 bp attB sequences as substrates for use 
in the cloning methods of the invention. 

Standard PCR conditions may be used to prepare the PCR product. The 
following suggested protocol employs Platinum Tag DNA Polymerase High 
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Fidelity®, available commercially from Life Technologies, Inc. (Rockville, MD). 
This enzyme mix eliminates the need for hot starts, has improved fidelity over Taq, 
and permits synthesis of a wide range of amplicon sizes, from 200 bp to 10 kb, or 
more, even on genomic templates. 



Materials needed: 



•Platinum Taq DNA Polymerase High Fidelity® (Life Technologies, Inc.) 
•attBl- and attB2- containing primer pair (see above) specific for your 
template 

•DNA template ( linearized plasmid or genomic DNA) 
• 1 OX High Fidelity PCR Buffer 
•lOmMdNTP mix 

•PEG/MgCl 2 Mix (30% PEG 8000, 30 mM MgCl 2 ) 



Procedure: 



1.) Assemble the reaction as follows: 



Component 


Reaction with 
Plasmid Target 


Reaction with Genomic 
Target 


10X High Fidelity PCR Buffer 


5 n\ 


5 nl 


dNTP Mix lOmM 


1 nl 


1 ^1 


MgS0 4 , 50mM 


2 nl 


2 |il 


attBl Primer, 10 |iM 


2 ^1 


1^1 


attB2 Primer, 10 nM 


2 ix\ 


1 Hi 


Template DNA 


1-5 ng* 


>100ng 


PLATINUM Taq High Fidelity 


2 nl 


1 Ul 


Water 


to 50 \il 


to 50 jil 



* Use of higher amounts of plasmid template may permit fewer cycles (10-15) of 
PCR 
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2. ) Add 2 drops mineral oil, as appropriate. 

3. ) Denature for 30 sec. at 94 °C. 

4. ) Perform 25 cycles: 

94 °C for 15 sec-30 sec 
55 °C for 15 sec-30 sec 
68 °C for 1 min per kb of template. 

5. ) Following the PCR reaction, apply 1-2 \xl of the reaction mixture to an 
agarose gel, together with size standards (e.g., 1 Kb Plus Ladder, Life 
Technologies, Inc.) and quantitation standards (e.g., Low Mass Ladder, Life 
Technologies, Inc.), to assess the yield and uniformity of the product. 

Purification of the PCR product is recommended, to remove attB primer 
dimers which can clone efficiently into the Entry Vector. The following protocol 
is fast and will remove DNA <300 bp in size: 

6. ) Dilute the 50 ^1 PCR reaction to 200 jil with TE. 

7. ) Add 100 nl PEG/MgCl 2 Solution. Mix and centrifuge immediately at 13,000 

RPM for 1 0 min at room temperature. Remove the supernatant (pellet is clear 
and hard to see). 

8. ) Dissolve the pellet in 50 jal TE and check recovery on a gel. 

If the starting PCR template is a plasmid that contains the gene for Kan r , 
it is advisable to treat the completed PCR reaction with the restriction enzyme 
Dpn\ to degrade the plasmid since unreacted residual starting plasmid is a 
potential source of false-positive colonies from the transformation of the 
GATEWAY™ Cloning System reaction. Adding ~5 units of Dpnl to the 
completed PCR reaction and incubating for 15 min at 37°C will eliminate this 
potential problem. Heat inactivate XhsDpnl at 65 °C for 1 5 min, prior to using the 
PCR product in the GATEWAY™ Cloning System reaction. 
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Example 9: Cloning attB-PCR products into Entry Vectors via the BP 
("Gateward") Reaction 

The addition of 5'-terminal attB sequences to PCR primers allows synthesis 
of a PCR product that is an efficient substrate for recombination with a Donor 
(attP) Plasmid in the presence of GATEWAY™ BP Clonase™ Enzyme Mix. This 
reaction produces an Entry Clone of the PCR product (See Figure 8). 

The conditions of the Gateward Cloning reaction with an attB PCR 
substrate are similar to those of the BP Reaction (see Example 10 below), except 
that the attB-PCR product (see Example 8) substitutes for the Expression Clone, 
and the attB-PCR positive control (attB-tet r ) substitutes for the Expression Clone 
Positive Control (GFP) 

Materials needed: 

• 5 X BP Reaction Buffer 

• Desired attB-PCR product DNA, 50-100 ng in < 8 \A TE. 

• Donor (attP) Plasmid (Figures 49-54), 75 ng/jil, supercoiled DNA 

• attB-tet r PCR product positive control, 25 ng//il 

• GATEWAY™ BP Clonase™ Enzyme Mix (stored at - 80° C) 

• lOx Clonase Stop Solution 

• pUC19DNA, 10pg/|il. 

• Chemically competent E.coli cells (competence: > lxl 0 7 CFU/^ig), 400 \A 
Notes: 

•Preparation of attB-PCR DNA: see Example 8. 

•The Positive Control attB-tefPCR product contains a functional copy of the 
tet r gene of pBR322, with its own promoter. By plating the transformation 
of the control BP Reaction on kanamycin (50 (ig/ml) plates (if kan r Donor 
Plasmids are used; see Figures 49-52) or an alternative selection agent (e.g., 
gentamycin, if gen r Donor Plasmids are used; see Figure 54), and then picking 
about 50 of these colonies onto plates with tetracycline (20 |ng/ml), the 
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percentage of Entry Clones containing functional tef among the colonies from 
the positive control reaction can be determined (% Expression Clones = 
(number of tef + kan r (or gen r ) colonies/kan r (or gen r ) colonies). 



Procedure: 

1 . Assemble reactions as follows. Combine all components except GATEWAY™ 
BP Clonase™ Enzyme Mix, before removing GATEWAY™ BP Clonase™ 
Enzyme Mix from frozen storage. 





Neg. 


Pos. 


Test 


Component 


Tubel 


Tube 2 


Tube 3 


attB-PCR product, 50-100 ng 






1 - 8 ul 


Donor (attP) Plasmid 75 ng/ul 


2 ul 


2 ul 


2 ul 


attB-PCR tef control DNA (75 ng/ul) 




4 ul 




5 X BP Reaction Buffer 


4 ul 


4 ul 


4 ul 


TE 


10 ul 


6 ul 


To 16 ul 


GATEWAY™ BP Clonase™ 
Enzyme Mix (store at -80° C, add 
last) 


4 ul 


4 ul 


4 ul 


Total Volume 


20 ul 


20 ul 


20 ul 



2. Remove the GATEWAY™ BP Clonase™ Enzyme Mix from the -80° C 
freezer, place immediately on ice. The Clonase takes only a few minutes to thaw. 

3. Add 4 ul of GATEWAY™ BP Clonase™ Enzyme Mix to the subcloning 
reaction, mix. 

4. Return GATEWAY™ BP Clonase™ Enzyme Mix to - 80° C freezer. 

5. Incubate tubes at 25° for at least 60 minutes. 
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6. Add 2 |il Proteinase K (2 ^g/|al) to all reactions. Incubate for 20 min at 37°C. 

7. Transform 2 |il into 100 [x\ competent E. coli, as per 3.2, above. Select on LB 

plates containing kanamycin, 50 ng/ml. 



Results : 

In initial experiments, primers for amplifying tetR and ampR from pBR322 
were constructed containing only the tetR- or ampR-specific targeting sequences, 
the targeting sequences plus attB 1 (for forward primers) or attB2 (for reverse 
primers) sequences shown in Figure 9, or the attBl or attB2 sequences with a 5' 
tail of four guanines. The construction of these primers is depicted in Figure 65. 
After PCR amplification of tetR and ampR from pBR322 using these primers and 
cloning the PCR products into host cells using the recombinational cloning system 
of the invention, the results shown in Figure 66 were obtained. These results 
demonstrated that primers containing attB sequences provided for a somewhat 
higher number of colonies on the tetracycline and ampicillin plates. However, 
inclusion of the 5' extensions of four or five guanines on the primers in addition 
to the attB sequences provided significantly better cloning results, as shown in 
Figures 66 and 67. These results indicate that the optimal primers for cloning of 
PCR products using recombinational cloning will contain the recombination site 
sequences with a 5' extension of four or five guanine bases. 

To determine the optimal stoichiometry between attB-containing PCR 
products and attP-containing Donor plasmid, experiments were conducted where 
the amount of PCR product and Donor plasmid were varied during the BP 
Reaction. Reaction mixtures were then transformed into host cells and plated on 
tetracycline plates as above. Results are shown in Figure 68. These results 
indicate that, for optimal recombinational cloning results with a PCR product in 
the size range of the tet gene, the amounts of attP-containing Donor plasmids are 
between about 100-500 ng (most preferably about 200-300 ng), while the optimal 
concentrations of attB-containing PCR products is about 25-100 ng (most 
preferably about 100 ng), per 20 |il reaction. 

Experiments were then conducted to examine the effect of PCR product size 
on efficiency of cloning via the recombinational cloning approach of the invention. 
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PCR products containing attBl and attB2 sites, at sizes 256 bp, 1 kb, 1.4 kb, 
3.4 kb, 4.6 kb, 6.9 kb and 10.1 kb were prepared and cloned into Entry vectors 
as described above, and host cells were transformed with the Entry vectors 
containing the cloned PCR products. For each PCR product, cloning efficiency 
was calculated relative to cloning of pUC19 positive control plasmids as follows: 



CFU/ng attB PCR product Size (kb) PCR product 

Cloning Efficiency = X - 

CFU/ng pUC 1 9 control Size (kb) pUC 1 9 control 



The results of these experiments are depicted in Figures 69A-69C (for 256 bp 
PCR fragments), 70A-70C (for 1 kb PCR fragments), 71 A-71C (for 1 .4 kb PCR 
fragments), 72A-72C (for 3.4 kb PCR fragments), 73A-73C (for 4.6 kb PCR 
fragments), 74 (for 6.9 kb PCR fragments), and 75-76 (for 10.1 kb PCR 
fragments). The results shown in these figures are summarized in Figure 77, for 
different weights and moles of input PCR DNA. 

Together, these results demonstrate that attB-containing PCR products 
ranging in size from about 0.25 kb to about 5 kb clone relatively efficiently in the 
recombinational cloning system of the invention. While PCR products larger than 
about 5 kb clone less efficiently (apparently due to slow resolution of 
cointegrates), longer incubation times during the recombination reaction appears 
to improve the efficiency of cloning of these larger PCR fragments. Alternatively, 
it may also be possible to improve efficiency of cloning of large (> about 5 kb) 
PCR fragments by using lower levels of input attP Donor plasmid and perhaps 
attB-containing PCR product, and/or by adjusting reaction conditions (e.g. , buffer 
conditions) to favor more rapid resolution of the cointegrates. 



Example 1 0: The BP Reaction 

One purpose of the Gateward ("Entry") reaction is to convert an Expression 
Clone into an Entry Clone. This is useful when you have isolated an individual 
Expression Clone from an Expression Clone cDNA library, and you wish to 
transfer the nucleic acid molecule of interest into another Expression Vector, or 
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to move a population of molecules from an attB or attL library. Alternatively, 
you may have mutated an Expression Clone and now wish to transfer the mutated 
nucleic acid molecule of interest into one or more new Expression Vectors. In 
both cases, it is necessary first to convert the nucleic acid molecule of interest to 
an Entry Clone. 



Materials needed: 

• 5 X BP Reaction Buffer 

• Expression Clone DNA, 100-300 ng in < 8 j^l TE. 

• Donor (attP) Vector, 75 ng/jil, supercoiled DNA 

• Positive control attB-tet-PCR DNA, 25 ng/|ul 

• GATEWAY™ BP Clonase™ Enzyme Mix (stored at - 80 °C) 

• Clonase Stop Solution (Proteinase K, 2 ng/|il). 

Notes: 

Preparation of the Expression Clone DNA: Miniprep DNA treated with RNase 
works well 

L As with the LR Reaction (see Example 14), the BP Reaction is strongly 
influenced by the topology of the reacting DNAs. In general, the reaction is 
most efficient when one of the DNAs is linear and the other is supercoiled, 
compared to reactions where the DNAs are both linear or both supercoiled. 
Further, linearizing the attB Expression Clone (anywhere within the vector) 
will usually give more colonies than linearizing the Donor (attP) Plasmid. If 
finding a suitable cleavage site within your Expression Clone vector proves 
difficult, you may linearize the Donor (attP) Plasmid between the attPl and 
attP2 sites (for example, at the Ncol site), avoiding the ccdB gene. Maps of 
Donor (attP) Plasmids are given in Figures 49-54. 



Procedure: 

1 . Assemble reactions as follows. Combine all components at room temperature, 
except GATEWAY™ BP Clonase™ Enzyme Mix, before removing 
GATEWAY™ BP Clonase™ Enzyme Mix from freezer. 
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Neg. 


Pos. 


Test 


Component 


lube 1 


lube z 


l ube j 


Positive Control, attB-tet-PCR DNA, 
25 ng/ul 


4 ul 


4 JJ.1 




Desired attB Expression Clone DNA 
(lOOng) linearized 






1 c >.1 

1 - O Ui 


JJonor (^attrj riasmiu, / j ng/ui 


1 ill 


? Ill 


? Ill 


5 X BP Reaction Buffer 


4 ul 


4 yl 


4 ul 


TE 


10 ul 


6[d 


To 16 ul 


GATEWAY™ BP Clonase™ Enzyme 
Mix (store at - 80° C, add last) 




4 


4 ul 


Total Volume 


20 ul 


20^1 


20 jil 



2. Remove the GATEWAY™ BP Clonase™ Enzyme Mix from the -80 °C 
freezer, place immediately on ice. The mixture takes only a few minutes to thaw. 
20 3. Add 4 of GATEWAY™ BP Clonase™ Enzyme Mix to the subcloning 

reaction, mix, 

4. Return GATEWAY™ BP Clonase™ Enzyme Mix to - 80° C freezer. 

5 . Incubate tubes at 25 ° for at least 60 minutes. If both the attB and attP DNAs 
are supercoiled, incubation for 2-24 hours at 25 °C is recommended. 

25 6. Add 2 |il Clonase Stop Solution. Incubate for 10 min at 37°C. 

7. Transform 2 |il into 100 \il competent E. coli, as above. Select on LB plates 
containing 50 jig/ml kanamycin. 

Example 11: Cloning PCR Products into Entry Vectors using Standard 
3 o Cloning Methods 

Preparation of Entry Vectors for Cloning of PCR Products 

All of the Entry Vectors of the invention contain the death gene ccdB as a 
35 stuffer between the "left" and "right" restriction sites. The advantage of this 

arrangement is that there is virtually no background from vector that has not been 
cut with both restriction enzymes, because the presence of the ccdB gene will kill 
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all standard E. coli strains. Thus it is necessary to cut each Entry Vector twice, 
to remove the ccdB fragment. 

We strongly recommend that, after digestion of the Entry Vector with the 
second restriction enzyme, you treat the reaction with phosphatase (calf intestine 
alkaline phosphatase, CIAP or thermosensitive alkaline phosphatase, TSAP). The 
phosphatase can be added directly to the reaction mixture, incubated for an 
additional time, and inactivated. This step dephosphorylates both the vector and 
ccdB fragments, so that during subsequent ligation there is less competition 
between the ccdB fragment and the DNA of interest for the termini of the Entry 
Vector. 

Blunt Cloning of PCR products 

Generally PCR products do not have 5' phosphates (because the primers are 
usually 5' OH), and they are not necessarily blunt. (On this latter point, see 
Brownstein, et al., BioTechniques 20; 1006, 1996 for a discussion of how the 
sequence of the primers affects the addition of single 3' bases.) The following 
protocol repairs these two defects. 

In a 0.5 ml tube, ethanol precipitate about 40 ng of PCR product (as judged 
from an agarose gel). 

1. Dissolve the precipitated DNA in 10 jal comprising 1 (il 10 mM rATP, 1 \il 
mixed 2 mM dNTPs (i.e., 2 mM each dATP, dCTP, dTTP, and dGTP), 2 jil 
5x T4 polynucleotide kinase buffer (350 mM Tris HC1 (pH7.6), 50 mM 
MgCl 2 , 500mM KC1, 5 mM 2-mercaptoethanol) 10 units T4 polynucleotide 
kinase, 1 ^il T4 DNA polymerase, and water to 10 jil. 

2. Incubate the tube at 37° for 10 minutes, then at 65° for 15 minutes, cool, 
centrifuge briefly to bring any condensate to the tip of the tube. 

3 . Add 5 of the PEG/MgCl 2 solution, mix and centrifuge at room temperature 
for 10 minutes. Discard supernatant. 

4. Dissolve the invisible precipitate in 10 ^1 containing 2 \x\ 5x T4 DNA ligase 
buffer (Life Technologies, Inc.), 0.5 units T4 DNA ligase, and about 50 ng of 
blunt, phosphatase-treated Entry Vector. 
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5. Incubate at 25 ° for 1 hour, then 65 ° for 1 0 minutes. Add 90 TE, transform 
10 \x\ into 50 - 100 |nl competent E. coli cells. 

6. Plate on kanamycin. 



Note: In the above protocol, steps b-c simultaneously polish the ends of the PCR 
product (through the exonuclease and polymerase activities of T4 DNA 
polymerase) and phosphorylate the 5' ends (using T4 polynucleotide kinase). It 
is necessary to inactivate the kinase, so that the blunt, dephosphorylated vector in 
step e cannot self ligate. Step d (the PEG precipitation) removes all small 
molecules (primers, nucleotides), and has also been found to improve the yield of 
cloned PCR product by 50 fold. 



Cloning PCR Products after Digestion with Restriction Enzymes 

Efficient cloning of PCR products that have been digested with restriction 
enzymes includes three steps: inactivation of Tag DNA polymerase, efficient 
restriction enzyme cutting, and removal of small DNA fragments. 

Inactivation of Tag DNA Polymerase : Carryover of Tag DNA polymerase 
and dNTPs into a RE digestion significantly reduces the success in cloning a PCR 
product (D. Fox et al., FOCUS 20(J):15, 1998), because Tag DNA polymerase 
can fill in sticky ends and add bases to blunt ends. Either TaqQuench™ 
(obtainable from Life Technologies, Inc. ; Rockville, Maryland) or extraction with 
phenol can be used to inactivate the Tag. 

Efficient Restriction Enzyme Cutting : Extra bases on the 5' end of each 
PCR primer help the RE cut near ends of PCR products. With the availability of 
cheap primers, adding 6 to 9 bases on the 5' sides of the restriction sites is a good 
investment to ensure that most of the ends are digested. Incubation of the DNA 
with a 5 -fold excess of restriction enzyme for an hour or more helps ensure 
success. 

Removal of Small Molecules before Ligation : Primers, nucleotides, 
primer dimers, and small fragments produced by the restriction enzyme digestion, 
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can all inhibit or compete with the desired ligation of the PCR product to the 
cloning vector. This protocol uses PEG precipitation to remove small molecules. 

Protocol for cutting the ends o f PCR products with restriction enzvmef s ) 

1 Inactivation of Taq DNA polymerase in the PCR product: 

Option A : Extraction with Phenol 

Al . Dilute the PCR reaction to 200 ^1 with TE. Add an equal volume of 
phenol: chloroform :isoamyl alcohol, vortex vigorously for 20 seconds, and 
centrifuge for 1 minute at room temperature. Discard the lower phase. 

A2. Extract the phenol from the DNA and concentrate as follows. Add 
an equal volume of 2-butanol (colored red with "Oil Red O" from Aldrich, 
if desired), vortex briefly, centrifuge briefly at room temperature. Discard 
the upper butanol phase. Repeat the extraction with 2-butanol. This time 
the volume of the lower aqueous phase should decrease significantly. 
Discard the upper 2-butanol phase. 

A3. Ethanol precipitate the DNA from the aqueous phase of the above 
extractions. Dissolve in a 200 of a suitable restriction enzyme (RE) 
buffer. 

Option B : Inactivation with TaqQuench 

Bl. Ethanol precipitate an appropriate amount of PCR product (100 ng 
to 1 \ig\ dissolve in 200 jliI of a suitable RE buffer. 

B2. Add 2 |lx1 TaqQuench. 

2. Add 10 to 50 units of restriction enzyme and incubate for at least 1 hour. 
Ethanol precipitate if necessary to change buffers for digestion at the other end of 
the PCR product. 
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3. Add Vi volume of the PEG/MgCl 2 mix to the RE digestion. Mix well and 
immediately centrifuge at room temperature for 10 minutes. Discard the 
supernatant (pellet is usually invisible), centrifuge again for a few seconds, discard 
any remaining supernatant. 

4. Dissolve the DNA in a suitable volume of TE (depending on the amount of 
PCR product in the original amplification reaction) and apply an aliquot to an 
agarose gel to confirm recovery. Apply to the same gel 20-100 ng of the 
appropriate Entry Vector that will be used for the cloning. 

Example 12: Determining The Expected Size of the GATEWA Y™ Cloning 
Reaction Products 

If you have access to a software program that will electronically cut and 
splice sequences, you can create electronic clones to aid you in predicting the sizes 
and restriction patterns of GATEWAY™ Cloning System recombination 
products. 

The cleavage and ligation steps performed by the enzyme Int in the 
GATEWAY™ Cloning System recombination reactions mimic a restriction 
enzyme cleavage that creates a 7-bp 5 '-end overhang followed by a ligation step 
that reseals the ends of the daughter molecules. The recombination proteins 
present in the Clonase cocktails (see Example 19 below) recognize the 15 bp core 
sequence present within all four types of att sites (in addition to other flanking 
sequences characteristic of each of the different types of att sites). 

By treating these sites in your software program as if they were restriction 
sites, you can cut and splice your Entry Clones with various Destination Vectors 
and obtain accurate maps and sequences of the expected results from your 
GATEWAY™ Cloning System reactions. 

Example 13: Protein Expression 
Brief Review of Protein Expression 

Transcription . The most commonly used promoters ini?. colt Expression 
Vectors are variants of the lac promoter, and these can be turned on by adding 
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IPTG to the growth medium. It is usually good to keep promoters off until 
expression is desired, so that the host cells are not made sick by the 
overabundance of some heterologous protein. This is reasonably easy in the case 
of the lac promoters used in E. coli. One needs to supply the lac I gene (or its 
more productive relative, the lac I q gene) to make lac repressor protein, which 
binds near the promoter and keeps transcription levels low. Some Destination 
Vectors for E. coli expression carry their own lacl q gene for this purpose. 
(However, lac promoters are always a little "on," even in the absence of IPTG.) 

Controlling transcription in eukaryotic cells is not nearly so straightforward 
or efficient. The tetracycline system of Bujard and colleagues is the most 
successful approach, and one of the Destination Vectors (pDESTl 1; Figure 31) 
has been constructed to supply this function. 

Translation: Ribosomes convert the information present in mRNA into 
protein. Ribosomes scan RNA molecules looking for methionine (AUG) codons, 
which begin nearly all nascent proteins. Ribosomes must, however, be able to 
distinguish between AUG codons that code for methionine in the middle of 
proteins from those at the start. Most often ribosomes choose AUGs that are 1) 
first in the RNA (toward the 5' end), and 2) have the proper sequence context . 
In E. coli the favored context (first recognized by Shine and Dalgarno, Eur. J. 
Biochem. 57 ': 221 (1975)) is a run of purines (As and Gs) from five to 12 bases 
upstream of the initiating AUG, especially AGGAGG or some variant. 

In eukaryotes, a survey of translated mRNAs by Kozak (J. Biol Chem. 
266: 19867 (1991)) has revealed a preferred sequence context, gcc Acc ATGG, 
around the initiating methionine, with the A at -3 being most important, and a 
purine at +4 (where the A of the ATG is +1), preferably a G, being next most 
influential. Having an A at -3 is enough to make most ribosomes choose the first 
AUG of an mRNA, in plants, insects, yeast, and mammals. (For a review of 
initiation of protein synthesis in eukaryotic cells, see: Pain, V.M. Eur. J. Biochem. 
236:747-771, 1996.) 

Consequences of Translation Signals for GA TEWA Y™ Cloning System : 
First, translation signals (Shine-Dalgarno inE. coli, Kozak in eukaryotes) have to 
be close to the initiating ATG. The attB site is 25 base pairs long. Thus if 
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translation signals are desired near the natural ATG of the nucleic acid molecule 
of interest, they must be present in the Entry Clone of that nucleic acid molecule 
of interest. Also, when a nucleic acid molecule of interest is moved from an Entry 
Clone to a Destination vector, any translation signals will move along. The result 
is that the presence or absence of Shine-Dalgarno and/or Kozak sequences in the 
Entry Clone must be considered, with the eventual Destination Vectors to be used 
in mind. 

Second, although ribosomes choose the 5 ' ATG most often, internal ATGs 
are also used to begin protein synthesis. The better the translation context around 
this internal ATG, the more internal translation initiation will be seen. This is 
important in the GATEWAY™ Cloning System, because you can make an Entry 
Clone of your nucleic acid molecule of interest, and arrange to have Shine- 
Dalgarno and/or Kozak sequences near the ATG. When this cassette is 
recombined into a Destination Vector that transcribes your nucleic acid molecule 
of interest, you get native protein. If you want, you can make a fusion protein in 
a different Destination Vector, since the Shine-Dalgarno and/or Kozak sequences 
do not contain any stop signals in the same reading frame. However, the presence 
of these internal translation signals may result in a significant amount of native 
protein being made, contaminating, and lowering the yield of, your fusion protein. 
This is especially likely with short fusion tags, like His6. 

A good compromise can be recommended. If an Entry Vector like 
pENTR7 (Figure 16) or pENTR8 (Figure 17) is chosen, the Kozak bases are 
present for native eukaryotic expression. The context for E. coli translation is 
poor, so the yield of an amino-terminal fusion should be good, and the fusion 
protein can be digested with the TEV protease to make near-native protein 
following purification. 

Recommended Conditions for Synthesis of Proteins in E. coli: When 
making proteins in£. coli it is advisable, at least initially, to incubate your cultures 
at 30°C, instead of at 37 °C. Our experience indicates that proteins are less likely 
to form aggregates at 30 °C. In addition, the yields of proteins from cells grown 
at 30 °C frequently are improved. 
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The yields of proteins that are difficult to express may also be improved 
by inducing the cultures in mid-log phase of growth, using cultures begun in the 
morning from overnight growths, as opposed to harvesting directly from an 
overnight culture. In the latter case, the cells are preferably in late log or 
stationary growth, which can favor the formation of insoluble aggregates. 

Example 14: Constructing Destination Vectors from Existing Vectors 

Destination Vectors function because they have two recombination sites, 
attRl and attR2, flanking a chloramphenicol resistance (CmR) gene and a death 
gene, ccdB. The GATEWAY™ Cloning System recombination reactions 
exchange the entire Cassette (except for a few bases comprising part of the attB 
sites) for the DNA segment of interest from the Entry Vector. Because attRl, 
CmR, ccdB gene, and attR2 are contiguous, they can be moved on a single DNA 
segment. If this Cassette is cloned into a plasmid, the plasmid becomes a 
Destination Vector. Figure 63 shows a schematic of the GATEWAY™ Cloning 
System Cassette; attR cassettes in all three reading frames contained in vectors 
pEZC15101, pEZC15102 and pEZC15103 are shown in Figures 64 A, 64B, and 
64C, respectively. 

The protocol for constructing a Destination Vector is presented below. 
Keep in mind the following points: 

Destination Vectors must be constructed and propagated in one of the DB 
strains of E. coli {e.g., DB3.1, and particularly E. coli LIBRARY 
EFFICIENCY® DB3.1™ Competent Cells) available from Life 
Technologies, Inc. (and described in detail in U. S. Provisional Application 
No. 60/122,392, filed on March 2, 1999, which is incorporated herein by 
reference), because the ccdB death gene will kill any E. coli strain that has 
not been mutated such that it will survive the presence of the ccdB gene. 
If your Destination Vector will be used to make a fusion protein, a 
GATEWAY™ Cloning System cassette with the correct reading frame 
must be used. The nucleotide sequences of the ends of the cassettes are 
shown in Figure 78. The reading frame of the fusion protein domain must 
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be in frame with the core region of the attRl site (for an amino terminal 
fusion) so that the six As are translated into two lysine codons. For a 
C-terminal fusion protein, translation through the core region of the attR2 
site should be in frame with -TAC-AAA-, to yield -Tyr-Lys-. 
Note that each reading frame Cassette has a different unique restriction 
site between the chloramphenicol resistance and ccdB genes (MM for 
reading frame A, BgKl for reading frame B, and Xbal for reading frame C; 
see Figure 63). 

• Most standard vectors can be converted to Destination Vectors, by 
inserting the Entry Cassette into the MCS of that vector. 

Protocol for Making a Destination Vector 

1 If the vector will make an amino fusion protein, it is necessary to keep the "aaa 
aaa" triplets in attRl in phase with the triplets of the fusion protein. Determine 
which Entry cassette to use as follows: 

a. ) Write out the nucleotide sequence of the existing vector near the 
restriction site into which the Entry cassette will be cloned. These must 
be written in triplets corresponding to the amino acid sequence of the 
fusion domain. 

b. ) Draw a vertical line through the sequence that corresponds to the 
restriction site end, after it has been cut and made blunt , i.e., after filling 
in a protruding 5' end or polishing a protruding 3' end. 

c. ) Choose the appropriate reading frame cassette: 

• If the coding sequence of the blunt end ends after a complete 
codon triplet, use the reading frame A cassette. See Figures 78, 
79 and 80. 
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• If the coding sequence of the blunt end ends in a single base, use 
the reading frame B cassette. See Figures 78, 79 and 81 . 

•If the coding sequence of the blunt end ends in two bases, use the 
reading frame C cassette. See Figures 78, 79, 82A-B, and 83 A-C. 

2. Cut one to five micrograms of the existing plasmid at the position where you 
wish your nucleic acid molecule of interest (flanked by att sites) to be after the 
recombination reactions. Note: it is better to remove as many of the MCS 
restriction sites as possible at this step. This makes it more likely that restriction 
enzyme sites within the GATEWAY™ Cloning System Cassette will be unique in 
the new plasmid, which is important for linearizing the Destination Vector 
(Example 14, below). 

3. Remove the 5' phosphates with alkaline phosphatase. While this is not 
mandatory, it increases the probability of success. 

4. Make the end(s) blunt with fill-in or polishing reactions. For example, to 1 jug 
of restriction enzyme-cut, ethanol-precipitated vector DNA, add: 

i. 20 jil 5x T4 DNA Polymerase Buffer (165 mM Tris-acetate (pH 
7.9), 330 mM Na acetate, 50 mM Mg acetate, 500 ng/ml BSA, 
2.5 mM DTT) 

ii. 5 |il lOmM dNTP mix 

iii. 1 Unit of T4 DNA Polymerase 

iv. Water to a final volume of 100 \il 

v. Incubate for 15 min at 37 °C. 

5. Remove dNTPs and small DNA fragments: Ethanol precipitate (add three 
volumes of room temperature ethanol containing 0. 1 M sodium acetate, mix well, 
immediately centrifuge at room temperature 5-10 minutes), dissolve wet 
precipitate in 200 jil TE, add 100 ^1 30% PEG 8000, 30 mM MgCl 2 , mix well, 
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immediately centrifuge for 10 minutes at room temperature, discard supernatant, 
centrifuge again a few seconds, discard any residual liquid. 

6. Dissolve the DNA to a final concentration of 10 - 50 ng per microliter. Apply 
20 - 100 ng to a gel next to supercoiled plasmid and linear size standards to 
confirm cutting and recovery. The cutting does not have to be 100% complete, 
since you will be selecting for the chloramphenicol marker on the Entry cassette. 

7. In a 10 \xl ligation reaction combine 10 - 50 ng vector, 10 - 20 ng of Entry 
Cassette (Figure 79), and 0. 5 units T4 DNA ligase in ligase buffer. After one hour 
(or overnight, whichever is most convenient), transform 1 jul into one of the DB 
strains of competent E. coli cells with agyrA462 mutation (See U.S. Provisional 
Application No. 60/122,392, filed on March 2, 1999, which is incorporated herein 
by reference), preferably DB3.1, and most preferably E. coli LIBRARY 
EFFICIENCY® DB3 . 1™ Competent Cells. The ccdB gene on the Entry Cassette 
will kill other strains of E. coli that have not been mutated so as to survive the 
presence of the ccdB gene. 

8. After expression in SOC medium, plate 10 fil and 100 jil on chloramphenicol- 
containing (30 jig / ml) plates, incubate at 37° C. 

9. Pick colonies, make miniprep DNA. Treat the miniprep with RNase A and 
store in TE. Cut with the appropriate restriction enzyme to determine the 
orientation of the Cassette. Choose clones with the attRl site next to the amino 
end of the protein expression function of the plasmid. 

Notes on Using Destination Vectors 

• We have found that about ten-fold more colonies result from a GATEWAY™ 
Cloning System reaction if the Destination Vector is linear or relaxed. If the 
competent cells you use are highly competent (>10 8 per microgram), 
linearizing the Destination Vector is less essential 
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• The site or sites used for the linearization must be within the Entry Cassette. 
Sites that cut once or twice within each cassette are shown in Figures 80-82. 

• Minipreps of Destination Vectors will work fine, so long as they have been 
treated with RNase. Since most DB strains are end A- (See U.S. Provisional 
Application No. 60/122,392, filed on March 2, 1999, which is incorporated 
herein by reference), minipreps can be digested with restriction enzymes 
without a prior phenol extraction. 

• Reading the OD 260 of miniprep DNA is inaccurate unless the RNA and 
ribonucleotides have been removed, for example, by a PEG precipitation. 



Example 15: Some Options in Choosing Appropriate Entry Vectors and 
Destination Vectors: An Example 

In some applications, it may be desirable to express a nucleic acid molecule 
of interest in two forms: as an amino-terminal fusion in E. coli, and as a native 
protein in eukaryotic cells. This may be accomplished in any of several ways: 

Option 1: Your choices depend on your nucleic acid molecule of interest 
and the fragment that contains it, as well as the available Entry Vectors. For 
eukaryotic translation, you need consensus bases according to Kozak (J. Biol 
Chem. 266:19867, 1991) near the initiating methionine (ATG) codon. All of the 
Entry Vectors offer this motif upstream of the Xmnl site (blunt cutter). One 
option is to amplify your nucleic acid molecule of interest, with its ATG, by PCR, 
making the amino end blunt and the carboxy end containing the natural stop codon 
followed by one of the "right side" restriction sites (EcoRI, Noil, Xho\ EcoRV, 
or Xbal of the pENTR vectors). 

If you know your nucleic acid molecule of interest does not have, for 
example, an Xhol site, you can make a PCR product that has this structure: 

Xho I 

5 ' ATG nnn nnn nnn TAA etc gag nnn nnn 3 1 

3 ' tac nnn nnn nnn att gag etc nnn nnn 5 ' 
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After cutting with Xhol, the fragment is ready to clone: 
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5 1 ATG nnn nnn nnn TAA c 3 1 

3 ' tac nnn nnn nnn att gag ct 5 1 

(If you follow this example, don't forget to put a phosphate on the amino oligo.) 

Option 2: This PCR product could be cloned into two Entry Vectors to 
give the desired products, between the Xmnl and Xhol sites: pENTRIA 
(Figures 10A, 10B ) or pENTR7 (Figures 16 A, 16B). If you clone into 
pENTRl A, amino fusions will have the minimal number of amino acids between 
the fusion domain and your nucleic acid molecule of interest, but the fusion cannot 
be removed with TEV protease. The converse is true of clones in pENTR7, i.e., 
an amino fusion can be cleaved with TEV protease, at the cost of more amino 
acids between the fusion and your nucleic acid molecule of interest. 

In this example, let us choose to clone our hypothetical nucleic acid 
molecule of interest into pENTR7, between the Xmnl and Xhol sites. Once this 
is accomplished, several optional protocols using the Entry Clone pENTR7 may 
be followed: 

Option 3: Since the nucleic acid molecule of interest has been amplified 
with PCR, it may be desirable to sequence it. To do this, transfer the nucleic acid 
molecule of interest from the Entry Vector into a vector that has priming sites for 
the standard sequencing primers. Such a vector is pDEST6 (Figures 26A, 26B). 
This Destination Vector places the nucleic acid molecule of interest in the opposite 
orientation to the lac promoter (which is leaky — see Example 3 above). If the 
gene product is toxic to E. coli, this Destination Vector will minimize its toxicity. 

Option 4: While the sequencing is going on, you might wish to check the 
expression of the nucleic acid molecule of interest in, for example, CHO cells, by 
recombining the nucleic acid molecule of interest into a CMV promoter vector 
(pDEST7, Figure 27; or pDEST12, Figure 32), or into a baculovirus vector 
(pDEST8, Figure 28; or pDEST 1 0, Figure 3 0) for expression in insect cells. Both 
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of these vectors will transcribe the coding sequence of your nucleic acid molecule 
of interest, and translate it from the ATG of the PCR product using the Kozak 
bases upstream of the Xmnl site. 



Option 5: If you wish to purify protein, for example to make antibodies, you can 
clone the nucleic acid molecule of interest into a His6 fusion vector, pDEST2 
(Figure 22). Since the nucleic acid molecule of interest is cloned downstream of 
the TEV protease cleavage domain of pENTR7 (Figure 16), the amino acid 
sequence of the protein produced will be: 

[ attBl - — ] TEV protease 

NH2- MSYYHHHHH HGITSLYKKAGF£ 7VL YFQ 1 GTM — COOH 

The attB site and the restriction sites used to make the Destination and 
Entry Vectors are translated into the underlined 11 amino acids 
(GITSLYKKAGF). Cleavage with TEV protease (arrow) leaves two amino acids, 
GT, on the amino end of the gene product. 

See Figure 55 for an example of a nucleic acid molecule of interest, the 
chloramphenicol acetyl transferase (CAT) gene, cloned into pENTR7 (Figure 16) 
as a blunt (ammo)-Xho\ (carboxy) fragment, then cloned by recombination into the 
His6 fusion vector pDEST2 (Figure 22). 

Option 6: If the His6 fusion protein is insoluble, you may go on and try a GST 
fusion. The appropriate Destination vector is pDEST3 (Figure 23). 

Option 7: If you need to make RNA probes and prefer SP6 RNA polymerase, 
you can make the top strand RNA with your nucleic acid molecule of interest 
cloned into pSPORT+ (pDEST5 (Figures 25A, 25B)), and the bottom strand 
RNA with the nucleic acid molecule of interest cloned into pSPORT(-) (pDEST6 
(Figures 26A, 26B)). Opposing promoters for T7 RNA polymerase and SP6 RNA 
polymerase are also present in these clones. 



WO 00/52027 PCT/US00/05432 

-132- 

Option 8: It is often worthwhile to clone your nucleic acid molecule of interest 
into a variety of Destination Vectors in the same experiment. For example, if the 
number of colonies varies widely when the various recombination reactions are 
transformed into E. coli, this may be an indication that the nucleic acid molecule 
of interest is toxic in some contexts. (This problem is more clearly evident when 
a positive control gene is used for each Destination Vector.) Specifically, if many 
more colonies are obtained when the nucleic acid molecule of interest is 
recombined into pDEST6 than in pDESTS, there is a good chance that leakiness 
of the lac promoter is causing some expression of the nucleic acid molecule of 
interest in pSPORT "+" (which is not harmful in pDEST6 because the nucleic acid 
molecule of interest is in the opposite orientation). 



Example 16: Demonstration of a One-tube Transfer of a PCR Product (or 
Expression Clone) to Expression Clone via a Recombinational 
Cloning Reaction 

In the BxP recombination (Entry or Gateward) reaction described herein, 
a DNA segment flanked by attB 1 and attB2 sites in a plasmid conferring ampicillin 
resistance was transferred by recombination into an attP plasmid conferring 
kanamycin resistance, which resulted in a product molecule wherein the DNA 
segment was flanked by attL sites (attLl and attL2). This product plasmid 
comprises an u attL Entry Clone" molecule, because it can react with a "attR 
Destination Vector" molecule via the LxR (Destination) reaction, resulting in the 
transfer of the DNA segment to a new (ampicillin resistant) vector. In the 
previously described examples, it was necessary to transform the BxP reaction 
products into E. coli, select kanamycin resistant colonies, grow those colonies in 
liquid culture, and prepare miniprep DNA, before reacting this DNA with a 
Destination Vector in an LxR reaction. 

The goal of the following experiment was to eliminate the transformation 
and miniprep DNA steps, by adding the BxP Reaction products directly to an LxR 
Reaction. This is especially appropriate when the DNA segment flanked by attB 
sites is a PCR product instead of a plasmid, because the PCR product cannot give 
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ampicillin-resistant colonies upon transformation, whereas attB plasmids (in 
general) carry an ampicillin resistance gene. Thus use of a PCR product flanked 
by attB sites in a BxP Reaction allows one to select for the ampicillin resistance 
encoded by the desired attB product of a subsequent LxR Reaction. 

Two reactions were prepared: Reaction A . negative control, no attB PCR 
product, (8 jlxI) contained 50 ng pEZC7102 (attP Donor plasmid, confers 
kanamycin resistance) and 2 BxP Clonase (22 ng / Int protein and 8 ng/\x\ 
IHF protein) in BxP buffer (25 mM Tris HC1, pH 7.8, 70 mM KC1, 5 mM 
spermidine, 0.5 mM EDTA, 250 \ig I ml BSA). Reaction B (24 \x\) contained 
150 ng pEZC7102, 6 |al BxP Clonase, and 120 ng of the attB -tet-PCR product 
in the same buffer as reaction A. The attB - tet - PCR product comprised the 
tetracycline resistance gene of plasmid pBR322, amplified with two primers 
containing either attBl or attB2 sites, and having 4 Gs at their 5' ends, as 
described earlier. 

The two reactions were incubated at 25°C for 30 minutes. Then aliquots 
of these reactions were added to new components that comprised LxR Reactions 
or appropriate controls for the LxR Reaction. Five new reactions were thus 
produced: 

Reaction 1: 5 \i\ of reaction A was added to a 5 j^tl LxR Reaction containing 
25 ng Ncol-cut pEZC8402 (the attR Destination Vector plasmid) in LxR buffer 
(37.5 mM Tris HC1, pH 7.7, 16.5 mM NaCl, 35 mM KC1, 5 mM spermidine, 
375 |ug / ml BSA), and 1 ^il of GATEWAY™ LR Clonase™ Enzyme Mix (total 
volume of 10 |al). 

Reaction 2: Same as reaction 1, except 5 ^1 of reaction B (positive) were added 
instead of reaction A (negative). 

Reaction 3: Same as reaction 2, except that the amounts of Nco-cut pEZC8402 
and GATEWAY™ LR Clonase™ Enzyme Mix were doubled, to 50 ng and 2 |al, 
respectively. 
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Reaction 4: Same as reaction 2, except that 25 ng of pEZ11104 (a positive 
control attL Entry Clone plasmid) were added in addition to the aliquot of 
reaction B. 



5 Reaction 5: Positive control LxR Reaction, containing 25 ng TVcoI-cut 

pEZC8402, 25 ng pEZl 1 104, 37.5 mM Tris HC1 pH 7.7, 16.5 mM NaCl, 35 mM 
KC1, 5 mM spermidine, 375 \ig I ml BSA and 1 jul GATEWAY™ LR Clonase™ 
Enzyme Mix in a total volume of 5 \A. 

10 All five reactions were incubated at 25 °C for 30 minutes. Then, 1 \i\ 

aliquots of each of the above five reactions, plus 1 |il from the remaining volume 
of Reaction B, the standard BxP Reaction, were used to transform 50 \A 
competent DH5a£. coli. DNA and cells were incubated on ice for 1 5 min., heat 
shocked at 42 °C for 45 sec, and 450 |nl SOC were added. Each tube was 

15 incubated with shaking at 37° C for 60 min. Aliquots of 100 |il and 400 y\ of 

each transformation were plated on LB plates containing either 50 (ig/ml 
kanamycin or 100 ng/ml ampicillin (see Table 2). A transformation with 10 pg of 
pUC19 DNA (plated on LB-amp 100 ) served as a control on the transformation 
efficiency of the DH5a cells. Following incubation overnight at 37 °C, the 

20 number of colonies on each plate was determined. 

Results of these reactions are shown in Table 2. 



Table 2* 



Reaction 


1 


2 


3 


4 


5 


6 


No.: 
















Number of Colonies 






Vol. 


Neg. 


IX 


2X 


LxR 


LxR 


BxP 


plated: 


Control 


pEZC8402 


pEZC8402 


Reaction 


Reaction 


Reaction 




BxP 


and LR 


andLR 


with Pos. 


alone 


alone 




Reaction 


Clonase™ 


Clonase™ 


Control 














DNA 






100 ul 


2 


1 


8 


9 


-1000 


-1000 


400 |il 


5 


10 


35 


62 


>2000 


>2000 


Selection: 


Kaa 


— 


Amn 


Amn 


Amp 


Kan 



♦(Transformation with pUC 19 DNA yielded 1 .4 x 10 9 CFU/^g DNA.) 
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34 of the 43 colonies obtained from Reaction 3 were picked into 2 ml 
Terrific Broth with 1 00 ^g/ml ampicillin and these cultures were grown overnight, 
with shaking, at 37°C. 27 of the 34 cultures gave at least moderate growth, and 
of these 24 were used to prepare miniprep DNA, using the standard protocol 
These 24 DNAs were initially analyzed as supercoiled (SC) DNA on a 1 % agarose 
gel to identify those with inserts and to estimate the sizes of the inserts. Fifteen 
of the 24 samples displayed SC DNA of the size predicted (5553 bp) if tetx7102 
had correctly recombined with pEZC8402 to yield tetx8402. One of these 
samples contained two plasmids, one of ~5500 bp and a one of ~3500 bp. The 
majority of the remaining clones were approximately 4100 bp in size. 

All 1 5 of the clones displaying SC DNA of predicted size (-5500 bp) were 
analyzed by two different double digests with restriction endonucl eases to confirm 
the structure of the expected product; tetx8402. (See plasmid maps, 
Figures 57-59) In one set of digests, the DNAs were treated with Not I and Eco 
RI, which should cut the predicted product just outside both attB sites, releasing 
the tef insert on a fragment of 1475 bp. In the second set of digests, the DNAs 
were digested with Noil and with Nrul. Nrul cleaves asymmetrically within the 
subcloned tef insert, and together with Noil will release a fragment of 1019 bp. 

Of the 15 clones analyzed by double restriction digestion, 14 revealed the 
predicted sizes of fragments for the expected product. 

Interpretation: 

The DNA components of Reaction B, pEZC7102 and attB-tet-PCR, are 
shown in Figure 56. The desired product of BxP Reaction B is tetx7 1 02, depicted 
in Figure 57. The LxR Reaction recombines the product of the BxP Reaction, 
tetx7102 (Figure 57), with the Destination Vector, pEZC8402, shown in 
Figure 58. The LxR Reaction with tetx7 1 02 plus pEZC8402 is predicted to yield 
the desired product tetx8402, shown in Figure 59. 

Reaction 2, which combined the BxP Reaction and LxR Reaction, gave 
few colonies beyond those of the negative control Reaction. In contrast, Reaction 
3, with twice the amount of pEZC8402 (Figure 58) and LxR Clonase, yielded a 
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larger number of colonies. These colonies were analyzed further, by restriction 
digestion, to confirm the presence of expected product. Reaction 4 included a 
known amount of attL Entry Clone plasmid in the combined BxP-plus-LxR 
reaction. But reaction 4 yielded only about 1% of the colonies obtained when the 
same DNA was used in a LxR reaction alone, Reaction 6. This result suggests 
that the LxR reaction may be inhibited by components of the BxP reaction. 

Restriction endonuclease analysis of the products of Reaction 3 revealed 
that a sizeable proportion of the colonies (14 of the 34 analyzed) contained the 
desired tef subclone, tetx8402 (Figure 59). 

The above results establish the feasibility of performing first a BxP 
recombination reaction followed by a LxR recombination reaction - in the same 
tube -- simply by adding the appropriate buffer mix, recombination proteins, and 
DNAs to a completed BxP reaction. This method should prove useful as a faster 
method to convert attB-containing PCR products into different Expression 
Clones, eliminating the need to isolate first the intermediate attL-PCR insert 
subclones, before recombining these with Destination Vectors. This may prove 
especially valuable for automated applications of these reactions. 

This same one-tube approach allows for the rapid transfer of nucleic acid 
molecules contained in attB plasmid clones into new functional vectors as well. 
As in the above examples, attL subclones generated in a BxP Reaction can be 
recombined directly with various Destination Vectors in a LxR reaction. The only 
additional requirement for using attB plasmids, instead of attB-containing PCR 
products, is that the Destination Vector(s) employed must contain a different 
selection marker from the one present on the attB plasmid itself and the attP 
vector. 

Two alternative protocols for a one-tube reaction have also proven useful 
and somewhat more optimal than the conditions described above. 

Alternative 1 : 

Reaction buffer contained 50 mM Tris-HCl (pH 7.5), 50 mM NaCl, 
0.25 mMEDTA, 2.5 mM spermidine, and 200 jug/mlBSA. After a 16 (or 3) hour 
incubation of the PCR product (100 ng) + attP Donor plasmid (100 ng) + 
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GATEWAY™ BP Clonase™ Enzyme Mix + Destination Vector (100 ng), 2 (il 
of GATEWAY™ LR Clonase™ Enzyme Mix (per 1 0 fil reaction mix) was added 
and the mixture was incubated an additional 6 (or 2) hours at 25 °C. Stop solution 
was then added as above and the mixture was incubated at 37 °C as above and 
transformed by electroporation with 1 (il directly into electrocompetent host cells. 
Results of this series of experiments demonstrated that longer incubation times ( 1 6 
hours vs. 3 hours for the BP Reaction, 6 hours vs. 2 hours for the LR Reaction) 
resulted in about twice as many colonies being obtained as for the shorter 
incubation times. With two independent genes, 10/10 colonies having the correct 
cloning patterns were obtained. 
Alternative 2: 

A standard BP Reaction under the reaction conditions described above for 
Alternative 1 was performed for 2 hours at 25 °C. Following the BP Reaction, the 
following components were added to the reaction mixture in a total volume of 7 
III: 

20 mM Tris-HCl, pH 7.5 
lOOmMNaCl 
5 ng/ml Xis-His6 
15% glycerol 

-1000 ng of Destination Vector 
The reaction mixture was then incubated for 2 hours at 25 °C, and 2.5 (il of stop 
solution (containing 2 (ig/ml proteinase K) was added and the mixture was 
incubated at 37°C for an additional 10 minutes. Chemically competent host cells 
were then transformed with 2 |nl of the reaction mixture, or electrocompetent host 
cells (e.g. , EMax DH1 0B cells; Life Technologies, Inc.) were electroporated with 
2 ^1 of the reaction mixture per 25-40 ^1 of cells. Following transformation, 
mixtures were diluted with SOC, incubated at 37°C, and plated as described 
above on media selecting for the selection markers on the Destination Vector and 
the Entry clone (B x P reaction product). Analogous results to those described 
for Alternative 1 were obtained with these reaction conditions - a higher level of 
colonies containing correctly recombined reaction products were observed. 
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Example 1 7: Demonstration of a One-tube Transfer of a PCR Product (or 
Expression Clone) to Expression Clone via a Recombinational 
Cloning Reaction 

Single-tube transfer of PCR product DNA or Expression Clones into 
Expression Clones by recombinational cloning has also been accomplished using 
a procedure modified from that described in Example 16. This procedure is as 
follows: 

•Perform a standard BP (Gateward) Reaction (see Examples 9 and 10) in 
20 jil volume at 25 °C for 1 hour. 



•After the incubation is over, take a 10 |nl aliquot from the 20 \x\ total 
volume and add 1 \i\ of Proteinase K (2 mg/ml) and incubate at 37° C for 
1 0 minutes. This first aliquot can be used for transformation and gel assay 
of BP reaction analysis. Plate BP reaction transformation on LB plates 
with Kanamycin (50 ug/ml). 

•Add the following reagents to the remaining 10 \i\ aliquot of the BP 
reaction: 

1 \i\ of0.75MNaCl 

2 jal of destination vector (150 ng/|al) 

4 |nl of LR Clonase™ (after thawing and brief mixing) 

•Mix all reagents well and incubate at 25 °C for 3 hours. Stop the reaction 
at the end of incubation with 1.7 |ul of Proteinase K (2 mg/ml) and 
incubate at 37°C for 10 minutes. 



•Transform 2 ^il of the completed reaction into 100 \x\ of competent cells. 
Plate 100 (il and 400 \x\ on LB plates with Ampicilin (100 jig/ml). 

Notes: 

•If your competent cells are less than 10 8 CFU/|ig, and you are concerned about 
getting enough colonies, you can improve the yield several fold by incubating the 
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BP reaction for 6-20 hours. Electroporation also can yield better colony output 
than chemical transformation. 

•PCR products greater than about 5-6 kb show significantly lower cloning 
efficiency in the BP reaction. In this case, we recommend using longer incubation 
times for both BP and LR steps. 

•If you want to move your insert gene into several destination vectors 
simultaneously, then scale up the initial BP reaction volume so that you have a 
10 jlxI aliquot for adding each destination vector. 



Example 18: Optimization ofGA TEWA F™ Clonase™ Enzyme Compositions 

The enzyme compositions containing Int and IHF (for BP Reactions) were 
optimized using a standard functional recombinational cloning reaction (a BP 
reaction) between attB-containing plasmids and attP-containing plasmids, 
according to the following protocol: 

Materials and Methods: 
Substrates: 

AttP - supercoiled pDONR201 

AttB - linear ~ 1Kb [ 3 H]PCR product amplified from pEZC7501 
Proteins: 

IntH6 — His 6 -carboxy- tagged X Integrase 
IHF — Integration Host Factor 

Clonase: 

50 ng/|ul IntH6 and 20 ng/|ul IHF, admixed in 25 mM Tris- HC1 (pH 7.5), 
22 mM NaCl, 5 mM EDTA, 1 mg/ml BSA, 5 mM Spermidine, and 50% 
glycerol. 
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Reaction Mixture (total volume of 40 jul) : 

1000 ng AttP plasmid 

600 ng AttB [ 3 H] PCR product 

8 [il Clonase (400 ng IntH6, 160 ng IHF) in 25 mM Tris-HCl (pH 7.5), 
22 mM NaCl, 5 mM EDTA, 1 mg/ml BSA, 5 mM Spermidine, 5 mM 
DTT. 



Reaction mixture was incubated for 1 hour at 25 °C, 4 \i\ of 2 \xg/[il 
proteinase K was added and mixture was incubated for an additional 20 minutes 
at 3 7 ° C . Mixture was then extracted with an equal volume of Phenol/Chloroform/ 
Isoamyl alcohol The aqueous layer was then collected, and 0. 1 volumes of 3 M 
sodium acetate and 2 volumes of cold 100% ethanol were added. Tubes were 
then spun in a microcentrifuge at maximum RPM for 10 minutes at room 
temperature. Ethanol was decanted, and pellets were rinsed with 70% ethanol and 
re-centrifixged as above. Ethanol was decanted, and pellets were allowed to air 
dry for 5-10 minutes and then dissolved in 20 jlxI of 33 mM Tris- Acetate (pH 7.8), 
66 mM potassium acetate, 10 mM magnesium acetate, 1 mM DTT, and ImM 
ATP. 2 units of exonuclease V (e.g. , Plasmid Safe; Epicentre, Inc., Madison, WI) 
was then added, and the mixture was incubated at 37° C for 30 minutes. 

Samples were then TC A- washed by spotting 30 jil of reaction mixture 
onto a Whatman GF/C filter, washing filters once with 1 0% TCA + 1% NaPPi for 
1 0 minutes, three times with 5% TCA for 5 minutes each, and twice with ethanol 
for 5 minutes each. Filters were then dried under a heat lamp, placed into a 
scintillation vial, and counted on a P liquid scintillation counter (LSC). 

The principle behind this assay is that, after exonuclease V digestion, only 
double-stranded circular DNA survives in an acid-insoluble form. All DNA 
substrates and products that have free ends are digested to an acid-soluble form 
and are not retained on the filters. Therefore, only the 3 H-labeled attB linear DNA 
which ends up in circular form after both inter- and intramolecular integration is 
complete is resistant to digestion and is recovered as acid-insoluble product. 
Optimal enzyme and buffer formulations in the Clonase compositions therefore are 
those that give the highest levels of circularized 3 H-labeled attB-containing 
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sequences, as determined by highest cpm in the LSC. Although this assay was 
designed for optimization of GATEWAY™ BP Clonase™ Enzyme Mix 
compositions (Int + IHF), the same type of assay may be performed to optimize 
GATEWAY™ LR Clonase™ Enzyme Mix compositions (Int + IHF + Xis), 
except that the reaction mixtures would comprise 1000 ng of AttR (instead of 
AttP) and 600 ng of AttL (instead of AttB), and 40 ng of His 6 ~carboxy- tagged 
Xis (XisH6) in addition to the IntH6 and IHF. 

Example 19: Testing Functionality of Entry and Destination Vectors 

As part of assessment of the functionality of particular vectors of the 
invention, it is important to functionally test the ability of the vectors to 
recombine. This assessment can be carried out by performing a recombinational 
cloning reaction (as schematized in Figures 2, 4, and 5 A and 5B, and as described 
herein and in commonly owned U.S. Application Nos. 08/486,139, filed June 7, 
1995, 08/663,002, filed June 7, 1996 (now U.S. Patent No. 5,888,732), 
09/005,476, filed January 12, 1998, and 09/177,387, filed October 23, 1998, the 
disclosures of all of which are incorporated by reference herein in their entireties), 
by transforming E. coli and scoring colony forming units. However, an alternative 
assay may also be performed to allow faster, more simple assessment of the 
functionality of a given Entry or Destination Vector by agarose gel 
electrophoresis. The following is a description of such an in vitro assay. 

Materials and Methods: 

Plasmid templates pEZC1301 (Figure 84) and pEZC1313 (Figure 85), 
each containing a single wild type att site, were used for the generation of PCR 
products containing attL or attR sites, respectively. Plasmid templates were 
linearized with^/wNI, phenol extracted, ethanol precipitated and dissolved in TE 
to a concentration of 1 ng/jal. 
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PCR primers (capi tal letters represent base changes from wildtvpeV 
attLl gggg agcct gcttttttGtacAaa gttggcatta taaaaaagca ttgc 

attL2 gggg agcct gctttCttGtacAaa gttggcatta taaaaaagca ttgc 

attL right tgttgccggg aagctagagt aa 

gggg Acaag ttTgtaCaaaaaagc tgaacgaga aacgtaaaat 
gggg Acaag ttTgtaCaaGaaagc tgaacgaga aacgtaaaat 
ca gacggcatga tgaacctgaa 

PCR primers were dissolved in TE to a concentration of 500 pmol/|il. Primer 
mixes were prepared, consisting of attLl + attLright primers, attL2 + attLright 
primers, attRl + attRright primers, and attR2 + attRright primers, each mix 
containing 20 pmol/|Lil of each primer. 

PCR reactions: 
1 (al plasmid template (1 ng) 
1 |nl primer pairs (20 pmoles of each) 
3 |J of H 2 0 

45 jul of Platinum PCR SuperMix® (Life Technologies, Inc.) 

Cycling conditions (performed in MJ thermocvclerV 
95°C/2 minutes 
94°C/30 seconds 

25 cycles of 58°C/30 seconds and 72°C/1.5 minutes 
72°C/5 minutes 
5°C/hold 

The resulting attL PCR product was 1.5 kb, and the resulting attR PCR 
product was 1.0 kb. 

PCR reactions were PEG/MgCl 2 precipitated by adding 150 [il H 2 0 and 
100 ^1 of 3xPEG/MgCl 2 solution followed by centrifiigation. The PCR products 
were dissolved in 50 |al of TE. Quantification of the PCR product was performed 
by gel electrophoresis of 1 (al and was estimated to be 50-100 ng/jul. 



attRl 
attR2 
attR right 
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Recombination reactions of PCR products containing attL or attR sites 
with GATEWAY™ plasmids was performed as follows: 
8 nl of H 2 0 

2 (al of attL or attR PCR product (100-200 ng) 
2 pal of GATEWAY™ plasmid (100 ng) 
4 ^1 of 5x Destination buffer 

4 fil of GATEWAY™ LR Clonase™ Enzyme Mix 
20 total volume (the reactions can be scaled down to a 5 ^1 total volume 
by adjusting the volumes of the components to about Vi of those shown above, 
while keeping the stoichiometries the same). 

Clonase reactions were incubated at 25 °C for 2 hours. 2 jal of proteinase K 
(2 mg/ml) was added to stop the reaction. 10 \l\ was then run on a 1 % agarose 
gel. Positive control reactions were performed by reacting attLl PCR product 
(1.0 kb) with attRl PCR product (1.5 kb) and by similarly reacting attL2 PCR 
product with attR2 PCR product to observe the formation of a larger (2.5 kb) 
recombination product. Negative controls were similarly performed by reacting 
attLl PCR product with attR2 PCR product and vice versa or reactions of attL 
PCR product with an attL plasmid, etc. 

In alternative assays, to test attB Entry vectors, plasmids containing single 
attP sites were used. Plasmids containing single att sites could also be used as 
recombination substrates in general to test all Entry and Destination vectors (i.e., 
those containing attL, attR , attB and attP sites). This would eliminate the need 
to do PCR reactions. 

Results: 

Destination and Entry plasmids when reacted with appropriate 
att-containing PCR products formed linear recombinant molecules that could be 
easily visualized on an agarose gel when compared to control reactions containing 
no attL or attR PCR product. Thus, the functionality of Destination and Entry 
vectors constructed according to the invention may be determined either by 
carrying out the Destination or Entry recombination reactions as depicted in 
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Figures 2, 4, and 5A and 5B, or more rapidly by carrying out the linearization 
assay described in this Example. 

Example 20: PCR Cloning Using Universal Adapter-Primers 

As described herein, the cloning of PCR products using the GATEWAY™ 
PCR Cloning System (Life Technologies, Inc.; Rockville, MD) requires the 
addition of attB sites (attBl and attB2) to the ends of gene-specific primers used 
in the PCR reaction. The protocols described in the preceding Examples suggest 
that the user add 29 bp (25 bp containing the attB site plus four G residues) to the 
gene-specific primer. It would be advantageous to high volume users of the 
GATEWAY™ PCR Cloning System to generate attB-containing PCR product 
using universal attB adapter-primers in combination with shorter gene-specific 
primers containing a specified overlap to the adapters. The following experiments 
demonstrate the utility of this strategy using universal attB adapter-primers and 
gene-specific primers containing overlaps of various lengths from 6 bp to 18 bp. 
The results demonstrate that gene-specific primers with overlaps of 10 bp to 18 
bp can be used successfully in PCR amplifications with universal attB adapter- 
primers to generate full-length PCR products. These PCR products can then be 
successfully cloned with high fidelity in a specified orientation using the 
GATEWAY™ PCR Cloning System. 

Methods and Results: 

To demonstrate that universal attB adapter-primers can be used with gene- 
specific primers containing partial attB sites in PCR reactions to generate full- 
length PCR product, a small 256 bp region of the human hemoglobin cDNA was 
chosen as a target so that intermediate sized products could be distinguished from 
full-length products by agarose gel electrophoresis. 

The following oligonucleotides were used: 

Bl-Hgb: GGGG ACA AGT TTG TAC AAA AAA GCA GGC T-5 ! -Hgb* 
B2 -Hgb : GGGG ACC ACT TTG TAC AAG AAA GCT GGG T-3'-Hgb** 
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5 



10 



i DDI _ Wrr"h * 
X o n> x nyjj . 


TG 


TAC AAA AAA GCA GGC T-5 ' -Hgb 


X O J_> 11 y . 


TG 


TAC AAG AAA GCT GGG T-3 1 -Hgb 


-1 cdi - "Herb • 




AC AAA AAA GCA GGC T-5 '-Hgb 






AC AAG AAA GCT GGG T-3' -Hgb 


1 9Ri _ "Rrr"h * 




AA AAA GCA GGC T-5' -Hgb 


i Too - T4rr"h * 




AG AAA GCT GGG T-3 1 -Hgb 


i i di -Nrrb * 
J L i-> x nyjj . 




A AAA GCA GGC T-5 '-Hgb 


i ido _T-Trr"b * 




G AAA GCT GGG T-3 ' -Hgb 


1 OR1 - Herb • 




AAA GCA GGC T-5 1 -Hgb 


l udz. ny j>-> . 




AAA GCT GGG T-3 '-Hgb 


qiii -Mrrb • 
_? i_> x XI y i^j . 




AA GCA GGC T-5 1 -Hgb 


qdo - Herb * 




AA GCT GGG T-3 1 -Hgb 


obi -rlyJD : 




A GCA GGC T-5' -Herb 


8B2-Hgb: 




A GCT GGG T-3 ' -Hgb 


7B1-Hgb : 




GCA GGC T-5 ' -Hgb 


7B2-Hgb : 




GCT GGG T-3 1 -Hgb 


6B1-Hgb: 




CA GGC T-5 1 -Hgb 


6B2-Hgb: 




CT GGG T-3 ' -Hgb 



20 attBl adapter: GGGG ACA AGT TTG TAC AAA AAA GCA GGC T 

attB2 adapter: GGGG ACC ACT TTG TAC AAG AAA GCT GGG T 

* -5' -Hgb = GTC ACT AGC CTG TGG AGC AAG A 
** -3' -Hgb = AGG ATG GCA GAG GGA GAC GAC A 

25 

The aim of these experiments was to develop a simple and efficient 
universal adapter PCR method to generate attB containing PCR products suitable 
for use in the GATEWAY™ PCR Cloning System. The reaction mixtures and 
thermocycling conditions should be simple and efficient so that the universal 
30 adapter PCR method could be routinely applicable to any PCR product cloning 

application. 

PCR reaction conditions were initially found that could successfully 
amplify predominately full-length PCR product using gene-specific primers 
containing 18bp and 15 bp overlap with universal attB primers. These conditions 
35 are outlined below: 
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10 pmoles of gene-specific primers 

10 pmoles of universal attB adapter-primers 

1 ng of plasmid containing the human hemoglobin cDNA 
100 ng of human leukocyte cDNA library DNA. 

5 jil of 1 Ox Platinum Taq HiFi® reaction buffer (Life Technologies, Inc.) 

2 jil of 50 mM MgS0 4 
1 nl of lOmMdNTPs 

0.2 jil of Platinum Taq HiFi® (1.0 unit) 
H 2 0 to 50 fil total reaction volume 



Cycling conditions: 



25 x 



95°C/5 min 
94°C/15 sec 
50°C/30 sec 
68°C/1 min 
68°C/5 min 
5°C/hold 



To assess the efficiency of the method, 2 jlxI (1/25) of the 50 |il PCR 
reaction was electrophoresed in a 3 % Agarose- 1000 gel. With overlaps of 12 bp 
or less, smaller intermediate products containing one or no universal attB adapter 
predominated the reactions. Further optimization of PCR reaction conditions was 
obtained by titrating the amounts of gene-specific primers and universal attB 
adapter-primers. The PCR reactions were set up as outlined above except that the 
amounts of primers added were: 



0, 1, 3 or 10 pmoles of gene-specific primers 
0, 10, 30 or 100 pmoles of adapter-primers 
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Cycling conditions: 



25 x 



95°C/3 min 
94°C/15 sec 
50°C/45 sec 
68°C/1 min 
68°C/5 min 
5°C/hold 



The use of limiting amounts of gene-specific primers (3 pmoles) and excess 
adapter-primers (30 pmoles) reduced the amounts of smaller intermediate 
products. Using these reaction conditions the overlap necessary to obtain 
predominately full-length PCR product was reduced to 12 bp^ The amounts of 
gene-specific and adapter-primers was further optimized in the following PCR 
reactions: 



0, 1, 2 or 3 pmoles of gene-specific primers 
0, 30, 40 or 50 pmoles of adapter-primers 



Cycling conditions: 



25 x 



95°C/3 min 
94°C/15 sec 
48°C/1 min 
68°C/1 min 
68°C/5 min 
5°C/hold 



The use of 2 pmoles of gene-specific primers and 40 pmoles of adapter- 
primers further reduced the amounts of intermediate products and generated 
predominately full-length PCR products with gene-specific primers containing an 
1 1 bp overlap. The success of the PCR reactions can be assessed in any PCR 
application by performing a no adapter control. The use of limiting amounts of 
gene-specific primers should give faint or barely visible bands when 1/25 to 1/10 
of the PCR reaction is electrophoresed on a standard agarose gel. Addition of the 
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universal attB adapter-primers should generate a robust PCR reaction with a much 
higher overall yield of product. 

PCR products from reactions using the 18 bp, 15 bp, 12 bp, 1 1 bp and 
10 bp overlap gene-specific primers were purified using the CONCERT® Rapid 
PCR Purification System (PCR products greater than 500 bp can be PEG 
precipitated). The purified PCR products were subsequently cloned into an attP 
containing plasmid vector using the GATEWAY™ PCR Cloning System (Life 
Technologies, Inc.; Rockville, MD) and transformed into£. coli. Colonies were 
selected and counted on the appropriate antibiotic media and screened by PCR for 
correct inserts and orientation. 

Raw PCR products (unpurified) from the attB adapter PCR of a plasmid 
clone of part of the human beta-globin (Hgb) gene were also used in 
GATEWAY™ PCR Cloning System reactions. PCR products generated with the 
full attB Bl/B2-Hgb, the 12B1/B2, 11B1/B2 and 10B1/B2 attB overlap Hgb 
primers were successfully cloned into the GATEWAY™ pENTR2 1 attP vector 
(Figure 49). 24 colonies from each (24 x 4 = 96 total) were tested and each was 
verified by PCR to contain correct inserts. The cloning efficiency expressed as 
cfu/ml is shown below: 



Primer Used cfu/ml 


Hgb full attB 


8,700 


Hgb 12 bp overlap 


21,000 


Hgb 1 1 bp overlap 


20,500 


Hgb 10 bp overlap 


13,500 


GEP control 


1.300 



Interestingly, the overlap PCR products cloned with higher efficiency than 
did the full attB PCR product. Presumably, and as verified by visualization on 
agarose gel, the adapter PCR products were slightly cleaner than was the full attB 
PCR product. The differences in colony output may also reflect the proportion 
of PCR product molecules with intact attB sites. 

Using the attB adapter PCR method, PCR primers with 12 bp attB 
overlaps were used to amplify cDNAs of different sizes (ranging from 1 to 4 kb) 
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from a leukocyte cDNA library and from first strand cDNA prepared from HeLa 
total RNA. While three of the four cDNAs were able to be amplified by this 
method, a non-specific amplification product was also observed that under some 
conditions would interfere with the gene-specific amplification. This non-specific 
product was amplified in reactions containing the attB adapter-primers alone 
without any gene-specific overlap primers present. The non-specific amplification 
product was reduced by increasing the stringency of the PCR reaction and 
lowering the attB adapter PCR primer concentration. 

These results indicate that the adapter-primer PCR approach described in 
this Example will work well for cloned genes. These results also demonstrate the 
development of a simple and efficient method to amplify PCR products that are 
compatible with the GATEWAY™ PCR Cloning System that allows the use of 
shorter gene-specific primers that partially overlap universal attB adapter-primers. 
In routine PCR cloning applications, the use of 12 bp overlaps is recommended. 
The methods described in this Example can thus reduce the length of gene-specific 
primers by up to 17 residues or more, resulting in a significant savings in 
oligonucleotide costs for high volume users of the GATEWAY™ PCR Cloning 
System. In addition, using the methods and assays described in this Example, one 
of ordinary skill can, using only routine experimentation, design and use analogous 
primer-adapters based on or containing other recombination sites or fragments 
thereof, such as attL, attR, attP, fox, FRT, etc. 

Example 21: Mutational Analysis of the Bacteriophage Lambda attL and 
attR Sites: Determinants of att Site Specificity in Site-specific 
Recombination 

To investigate the determinants of att site specificity, the bacteriophage 
lambda attL and attR sites were systematically mutagenized. As noted herein, the 
determinants of specificity have previously been localized to the 7 bp overlap 
region (TTTATAC, which is defined by the cut sites for the integrase protein and 
is the region where strand exchange takes place) within the 15 bp core region 
(GCTTTTTTATACT AA) which is identical in all four lambda att sites, attB, attV, 
attL and attR, This core region, however, has not heretofore been systematically 
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mutagenized and examined to define precisely which mutations produce unique 
changes in att site specificity. 

Therefore, to examine the effect of att sequence on site specificity, mutant 
attL and attK sites were generated by PCR and tested in an in vitro site-specific 
recombination assay. In this way all possible single base pair changes within the 
7 bp overlap region of the core att site were generated as well as five additional 
changes outside the 7 bp overlap but within the 1 5 bp core att site. Each attL PCR 
substrate was tested in the in vitro recombination assay with each of the attK PCR 
substrates. 

Methods 

To examine both the efficiency and specificity of recombination of mutant 
attL and attK sites, a simple in vitro site-specific recombination assay was 
developed. Since the core regions of attL and attK lie near the ends of these sites, 
it was possible to incorporate the desired nucleotide base changes within PCR 
primers and generate a series of PCR products containing mutant attL and attK 
sites. PCR products containing attL and attK sites were used as substrates in an 
in vitro reaction with GATEWAY™ LR Clonase™ Enzyme Mix (Life 
Technologies, Inc.; Rockville, MD). Recombination between a 1.5 kb attL PCR 
product and a 1 .0 kb attK PCR product resulted in a 2.5 kb recombinant molecule 
that was monitored using agarose gel electrophoresis and ethidium bromide 
staining. 

Plasmid templates pEZC1301 (Figure 84) and pEZC1313 (Figure 85), 
each containing a single wild type attL or attK site, respectively, were used for the 
generation of recombination substrates. The following list shows primers that 
were used in PCR reactions to generate the attL PCR products that were used as 
substrates in L x R Clonase reactions (capital letters represent changes from the 
wild-type sequence, and the underline represents the 7 bp overlap region within 
the 1 5 bp core att site; a similar set of PCR primers was used to prepare the attK 
PCR products containing matching mutations): 
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GATEWAY™ sites (note: attL2 sequence in GATEWAY™ plasmids begins 
"accca" while the attL2 site in this example begins "agccf'to reflect wild-type attL 
outside the core region.) : 



attLl: gggg agcct gcttttttGtacAaa gttggcatta taaaaa- 
agca ttgc 

attL2: gggg agcct gcttt CttGtac Aaa gttggcatta taaaaa- 
10 agca ttgc 

Wild-type: 

attLO: gggg agcct gcttttttatactaa gttggcatta taaaaa- 
agca ttgc 

15 

Single base changes from wild-type : 

attLTIA: gggg agcct gctttAttatactaa gttggcatta taaaaa- 
agca ttgc 

20 attLTIC: gggg agcct gctttCttatactaa gttggcatta taaaaa- 

agca ttgc 

attLTIG: gggg agcct gcttt Gttatac taa gttggcatta taaaaa- 
agca ttgc 

25 

attLT2A: gggg agcct gcttttAtatactaa gttggcatta taaaaa- 
agca ttgc 

attLT2C: gggg agcct gcttttCtatactaa gttggcatta taaaaa- 
30 agca ttgc 

attLT2G: gggg agcct gcttttGta/tactaa gttggcatta taaaa- 
aagca ttgc 



35 



WO 00/52027 PCT/US00/05432 

-152- 

attLT3A: gggg agcct gctttttAatactaa gttggcatta taaaa- 
aagca ttgc 

attLT3C: gggg agcct gctttttCatactaa gttggcatta taaaa- 
aagca ttgc 



attLT3G: gggg agcct gctttttGatactaa gttggcatta taaaa- 
aagca ttgc 



attLA4C: gggg agcct gcttttttCtactaa gttggcatta taaaa- 
aagca ttgc 



a t tLA4 G : gggg agcct gcttttttGtactaa gttggcatta taaaa- 
aagca ttgc 

attLA4T: gggg agcct gcttttttTtactaa gttggcatta taaaa- 
aagca ttgc 



attLTSA: gggg agcct gcttttttaAactaa gttggcatta taaaa- 
aagca ttgc 

attLT5C: gggg agcct gcttttttaCactaa gttggcatta taaaa- 
aagca ttgc 



attLTBG: gggg agcct gcttttttaGactaa gttggcatta taaaa- 
aagca ttgc 



attLAGC: gggg agcct gcttttttatCctaa gttggcatta taaaa- 
aagca ttgc 
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attLA6G: gggg agcct gcttttttatGctaa gttggcatta taaaa- 
aagca ttgc 



attLA6T: gggg agcct gcttttttatTctaa gttggcatta taaaa 
aagca ttgc 



attLC7A: gggg agcct gcttttttataAtaa gttggcatta taaaa- 
aagca ttgc 



attLC7G: gggg agcct gcttttttataGtaa gttggcatta taaaa- 
aagca ttgc 

attLC7T: gggg agcct gcttttttataTtaa gttggcatta taaaa- 
aagca ttgc 

Single base changes outside of the 7 bp overlap: 

attL8: gggg agcct Acttttttatactaa gttggcatta taaaa- 
aagca ttgc 



attL9: gggg agcct gcCtttttatactaa gttggcatta taaaaa 
agca ttgc 

attLlO: gggg agcct gcttctttatactaa gttggcatta taaaaa- 
agca ttgc 

attL14: gggg agcct gcttttttatacCaa gttggcatta taaaaa- 
agca ttgc 



attLlS: gggg agcct gcttttttatactaG gttggcatta taaaaa- 
agca ttgc 
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Note: additional vectors wherein the first nine bases are gggg agcca {i.e., 
substituting an adenine for the thymine in the position immediately preceding the 
15-bp core region), which may or may not contain the single base pair 
substitutions (or deletions) outlined above, can also be used in these experiments. 

Recombination reactions of attL- and attR- containing PCR products was 
performed as follows: 

8 ill of H 2 0 

2 fil of attL PCR product (100 ng) 
2 |il of attR PCR product ( 1 00 ng) 
4 jal of 5x buffer 

4 ul of GATEWAY™ LR Clonase™ Enzyme Mix 
20 |nl total volume 

Clonase reactions were incubated at 25°C for 2 hours. 

2 jil of 1 OX Clonase stop solution (proteinase K, 2 mg/ml) were added to stop the 
reaction. 

10 jliI were run on a 1 % agarose gel. 
Results 

Each attL PCR substrate was tested in the in vitro recombination assay 
with each of the attR PCR substrates. Changes within the first three positions of 
the 7 bp overlap ( TTT ATAC) strongly altered the specificity of recombination. 
These mutant att sites each recombined as well as the wild-type, but only with 
their cognate partner mutant; they did not recombine detectably with any other att 
site mutant. In contrast, changes in the last four positions (TTT ATAC ) only 
partially altered specificity; these mutants recombined with their cognate mutant 
as well as wild-type att sites and recombined partially with all other mutant att 
sites except for those having mutations in the first three positions of the 7 bp 
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overlap. Changes outside of the 7 bp overlap were found not to affect specificity 
of recombination, but some did influence the efficiency of recombination. 

Based on these results, the following rules for att site specificity were 
determined: 

•Only changes within the 7 bp overlap affect specificity. 

•Changes within the first 3 positions strongly affect specificity. 

•Changes within the last 4 positions weakly affect specificity. 

Mutations that affected the overall efficiency of the recombination reaction 
were also assessed by this method. In these experiments, a slightly increased (less 
than 2-fold) recombination efficiency with attLTIA and attLCIT substrates was 
observed when these substrates were reacted with their cognate attR partners. 
Also observed were mutations that decreased recombination efficiency 
(approximately 2-3 fold), including a//LA6G, atfL\4 and attL\5 . These mutations 
presumably reflect changes that affect Int protein binding at the core att site. 

The results of these experiments demonstrate that changes within the first 
three positions of the 7 bp overlap (TTTATAC) strongly altered the specificity of 
recombination {i.e., att sequences with one or more mutations in the first three 
thymidines would only recombine with their cognate partners and would not 
cross-react with any other att site mutation). In contrast, mutations in the last 
four positions (TTT ATAC ) only partially altered specificity {i.e., att sequences 
with one or more mutations in the last four base positions would cross-react 
partially with the wild-type att site and all other mutant att sites, except for those 
having mutations in one or more of the first three positions of the 7 bp overlap). 
Mutations outside of the 7 bp overlap were not found to affect specificity of 
recombination, but some were found to influence {i.e., to cause a decrease in) the 
efficiency of recombination. 

Example 22: Discovery of Att Site Mutations That Increase the Cloning 
Efficiency of GA TEWA Y™ Cloning Reactions 

In experiments designed to understand the determinants of att site 
specificity, point mutations in the core region of attL were made. Nucleic acid 
molecules containing these mutated attL sequences were then reacted in an LR 
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reaction with nucleic acid molecules containing the cognate attK site (/. e. , an attR 
site containing a mutation corresponding to that in the attL site), and 
recombinational efficiency was determined as described above. Several mutations 
located in the core region of the att site were noted that either slightly increased 
(less than 2-fold) or decreased (between 2-4-fold) the efficiency of the 
recombination reaction (Table 3). 

Table 3. Effects of attL mutations on Recombination Reactions. 

Sequence Effect on 

Recombination 

agcctgcttttttatactaagttggcatta 
agcctgctttAttatactaagtt ggc a 1 1 a slightly increased 
agcctgcttttttataTtaagttggcatta slightly increased 
agcctgcttttttatGctaagttggcatta decreased 
agcctgcttttttatacCaagttggcatta decreased 
agcctgcttttttatactaGgttggcatta decreased 

consensus CAACTTnnTnnnAnnAAGTTG 

It was also noted that these mutations presumably reflected changes that 
either increased or decreased, respectively, the relative affinity of the integrase 
protein for binding the core att site. A consensus sequence for an integrase core- 
binding site (CAACTTNNT) has been inferred in the literature but not directly 
tested (see, e.g., Ross and Landy, Cell 33:261-272 (1983)) . This consensus core 
integrase-binding sequence was established by comparing the sequences of each 
of the four core att sites found in attP and attB as well as the sequences of five 
non-att sites that resemble the core sequence and to which integrase has been 
shown to bind in vitro. These experiments suggest that many more att site 
mutations might be identified which increase the binding of integrase to the core 
att site and thus increase the efficiency of GATEWAY™ cloning reactions. 



Site 

attLO 

attL5 

attLG 

attL13 

attL14 

attL15 
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Example 23 : Effects of Core Region Mutations on Recombination Efficiency 
To directly compare the cloning efficiency of mutations in the att site core 
region, single base changes were made in the attB2 site of an attBl-TET-attB2 
PCR product. Nucleic acid molecules containing these mutated attB2 sequences 
were then reacted in a BP reaction with nucleic acid molecules containing non- 
cognate attP sites (i.e., wildtype attV2), and recombinational efficiency was 
determined as described above The cloning efficiency of these mutant attB2 
containing PCR products compared to standard attBl-TET-attB2 PCR product 
are shown in Table 4. 



Table 4. Efficiency of Recombination With Mutated attB 2 Sites. 

Cloning 

She Sequence Mutation Efficiency 



attBO 


tcaagttagtataaaaaagcaggct 






attBl 


ggggacaagtttetacaaaaaagcaggct 






attB2 


ggggaccactttgtacaagaaagctgggt 




100% 


attB2.1 


ggggaAcactttgtacaagaaaactgggt 


C->A 


40% 


attB2.2 


ggggacAactttgtacaagaaagctgggt 


C-»A 


131% 


attB2.3 


ggggaccCctttgtacaagaaaectgggt 


A^C 


4% 


attB2.4 


ggggaccaAtttgtacaagaaaectgggt 


C->A 


11% 


attB2.5 


ggggaccacGttgtacaagaaasctgggt 


T->G 


4% 


attB2.6 


ggggaccactGtgtacaagaaagctgggt 


T^G 


6% 


attB2.7 


ggggaccacttGgtacaagaaagctgggt 


T^G 


1% 


attB2.8 


ggggaccactttTtacaaeaaascteggt 


G->T 


0.5% 



As noted above, a single base change in the attB2.2 site increased the 
cloning efficiency of the attBl-TET-attB2.2 PCR product to 131% compared to 
the attBl-TET-attB2 PCR product. Interestingly, this mutation changes the 
integrase core binding site of attB2 to a sequence that matches more closely the 
proposed consensus sequence. 
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Additional experiments were performed to directly compare the cloning 
efficiency of an attB 1 -TET-attB2 PCR product with a PCR product that contained 
attB sites containing the proposed consensus sequence (see Example 22) of an 
integrase core binding site. The following attB sites were used to amplify attB- 
TET PCR products: 



attBl ggggacaagtttgtacaaaaaagcaggct 

attBl . 6 crcrQQacaaCttt Qtacaaa aaagTTggct 

attB2 qcrcfCfaccacttt Qtacaacr aaaqctgggt 

attB2 . 10 crcrcrcracAacttt crtacaaa aaagTtgggt 

BP reactions were carried out between 300 ng (100 fmoles) of 
pDONR201 (Figure 49A) with 80 ng (80 fmoles) of attB-TET PCR product in a 
20 |iil volume with incubation for 1 . 5 hrs at 25 ° C, creating pENTR20 1 -TET Entry 
clones. A comparison of the cloning efficiencies of the above-noted attB sites in 
BP reactions is shown in Table 5. 

Table 5. Cloning efficiency of BP Reactions. 



PCR nroduct 


CFU/ml 


Fold Increase 


Bl-tet-B2 


7,500 




B1.6-tet-B2 


12,000 


1.6x 


Bl-tet-B2.10 


20,900 


2.8 x 


B1.6-tet-B2.10 


30.100 


4.0 x 



These results demonstrate that attB PCR products containing sequences 
that perfectly match the proposed consensus sequence for integrase core binding 
sites can produce Entry clones with four-fold higher efficiency than standard 
Gateway attBl and attB2 PCR products. 

The entry clones produced above were then transferred to pDEST20 
(Figure 40A) via LR reactions (300 ng (64 fmoles) pDEST20 mixed with 50 ng 
(77 fmoles) of the respective pENTR20 1 -TET Entry clone in 20 ^1 volume; 
incubated for 1 hr incubation at 25 °C). The efficiencies of cloning for these 
reactions are compared in Table 6. 
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Table 6. Cloning Efficiency of LR Reactions. 



r>ENTR201-TKT x nr>F.ST7.0 


cft I/ml 


Fold Increase 


Ll-tet-L2 


5,800 




L1.6-tet-L2 


8,000 


1.4 


Ll-tet-L2.10 


10,000 


1.7 


L1.6-tet-L2.10 


9.300 


1.6 



These results demonstrate that the mutations introduced into attB 1.6 and 
attB2.10 that transfer with the gene into entry clones slightly increase the 
efficiency of LR reactions. Thus, the present invention encompasses not only 
mutations in attB sites that increase recombination efficiency, but also to the 
corresponding mutations that result in the attL sites created by the BP reaction. 

To examine the increased cloning efficiency of the attBl .6-TET-attB2. 10 
PCR product over a range of PCR product amounts, experiments analogous to 
those described above were performed in which the amount of attB PCR product 
was titrated into the reaction mixture. The results are shown in Table 7. 



Table 7. Titration of attB PCR products. 



Amount of attB PCR 
product fng> 


PCR product 


CFU/ml 


Fold Increase 


20 


attBl-TET-attB2 


3,500 


6.1 


attB1.6-TET-attB2.10 


21,500 


50 


attBl-TET-attB2 


9,800 


5.0 


attB1.6-TET-attB2.10 


49.000 


100 


attBl-TET-attB2 


18,800 


2.8 


attB1.6-TET-attB2 10 


53.000 


200 


attBl-TET-attB2 


19,000 


2.5 


attB 1 6-TF.T-attB2 1 0 


48.000 



These results demonstrate that as much as a six-fold increase in cloning 
efficiency is achieved with the attBl 6-TET-attB2. 10 PCR product as compared 
to the standard attBl-TET-attB2 PCR product at the 20 ng amount. 
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Example 24: Determination of attB Sequence Requirements for Optimum 
Recombination Efficiency 

To examine the sequence requirements for attB and to determine which 
attB sites would clone with the highest efficiency from populations of degenerate 
attB sites, a series of experiments was performed. Degenerate PCR primers were 
designed which contained five bases of degeneracy in the B-arm of the attB site. 
These degenerate sequences would thus transfer with the gene into Entry clone 
in BP reactions and subsequently be transferred with the gene into expression 
clones in LR reactions. The populations of degenerate attB and attL sites could 
thus be cycled from attB to attL back and forth for any number of cycles. By 
altering the reaction conditions at each transfer step (for example by decreasing 
the reaction time and/or decreasing the concentration of DNA) the reaction can 
be made increasingly more stringent at each cycle and thus enrich for populations 
of attB and attL sites that react more efficiently. 

The following degernerate PCR primers were used to amplify a 500 bp 
fragment from pUC18 which contained the lacZ alpha fragment (only the attB 
portion of each primer is shown): 

attBl GGGG ACAAGTTT GTACAAA AAAGC AGGCT 

attBlnl6-20 GGGG ACAAGTTT GTACAAA nnnnn AGGCT 
attBln21-25 GGGG ACAAGTTT GTACAAA AAAGC nnnnn 

attB2 GGGG AC C AC T TT GTACAAG AAAGC TGGGT 

attB2nl6-2 0 GGGG ACCACTTTGTACAAG nnnnn TGGGT 
attB2n21-25 GGGG ACCACTTT GTACAAG AAAGC nnnnn 

The starting population size of degenerate att sites is 4 5 or 1024 molecules. Four 
different populations were transferred through two BP reactions and two LR 
reactions. Following transformation of each reaction, the population of 
transformants was amplified by growth in liquid media containing the appropriate 
selection antibiotic. DNA was prepared from the population of clones by alkaline 
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lysis miniprep and used in the next reaction. The results of the BP and LR cloning 
reactions are shown below. 



BP-1, overnight reactions 





cfu/ml 


percent of control 


attBl-LacZa-attB2 


78,500 


100% 


attB lnl6-20-LacZa-attB2 


1,140 


1.5 % 


attB 1 n2 1 -25-LacZa-attB2 


11,100 


14% 


attB 1 -LacZa-attB2n 1 6-20 


710 


0.9 % 


attB 1 -LacZa-attB2n2 1 -25 


16,600 


21 % 


LR-1, pENTR201-LacZa x pDEST20/£coRI, lhr reactions 




cfu/ml 


percent of control 


attL 1 -LacZa-attL2 


20,000 


100% 


attL 1 n 1 6-20-LacZa-attL2 


2,125 


11 % 


attL 1 n2 1 -25-LacZa-attL2 


2,920 


15 % 


attL 1 -LacZa-attL2n 1 6-20 


3,190 


16% 


attL 1 -LacZa-attL2n2 1 -25 


1,405 


7% 


BP-2, pEXP20-LacZa/.S , caI x pDONR 201, lhr reactions 




cfu/ml 


percent of control 


attBl-LacZa-attB2 


48,600 


100% 


attB 1 n 1 6-20-LacZa-attB2 


22,800 


47% 


attB 1 n2 1 -25-LacZa-attB2 


31,500 


65 % 


attB 1 -LacZa-attB2n 1 6-20 


42,400 


87% 


attB 1 -LacZa-attB2n2 1 -25 


34,500 


71 % 


LR-2, pENTR201-LacZa x pDEST6M/cd, lhr reactions 




cfu/ml 


percent of control 


attL 1 -LacZa-attL2 


23,000 


100% 


attL 1 n 1 6-20-LacZa-attL2 


49,000 


213 % 


attLln21 -25-LacZa-attL2 


18,000 


80% 


attL 1 -LacZa-attL2n 16-20 


37,000 


160% 


attL 1 -LacZa-attL2n2 1 -2 5 


57,000 


250 % 



These results demonstrate that at each successive transfer, the cloning 
efficiency of the entire population of att sites increases, and that there is a great 
deal of flexibility in the definition of an attB site. Specific clones may be isolated 
from the above reactions, tested individually for recombination efficiency, and 
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sequenced. Such new specificities may then be compared to known examples to 
guide the design of new sequences with new recombination specificities. In 
addition, based on the enrichment and screening protocols described herein, one 
of ordinary skill can easily identify and use sequences in other recombination sites, 
e.g., other att sites, lox, FRT, etc., that result in increased specificity in the 
recombination reactions using nucleic acid molecules containing such sequences. 



Example 25: Design of att Site PCR Adapter-Primers 

Additional studies were performed to design gene-specific primers with 
12bp of attBl and attB2 at their 5'-ends. The optimal primer design for art- 
containing primers is the same as for any PCR primers: the gene-specific portion 
of the primers should ideally have a Tm of > 50 °C at 50 mM salt (calculation of 
Tm is based on the formula 59.9 + 41(%GC) - 675/n). 

Primers: 

12bp attBl: AA AAA GCA GGC TNN - forward gene-specific primer 
12bpattB2: A GAA AGC TGG GTN - reverse gene-specific primer 
attBl adapter primer: GGGGAGAAGTTTGTACAAAAAAGCAGGCT 
attB2 adapter primer: GGGGACCACTTTGTACAAGAAAGCTGGGT 
Protocol: 

(1) Mix 200 ng of cDNA library or 1 ng of plasmid clone DNA 
(alternatively, genomic DNA or RNA could be used) with 10 pmoles of gene 
specific primers in a 50 ul PCR reaction, using one or more polypeptides having 
DNA polymerase activity such as those described herein. (The addition of greater 
than 10 pmoles of gene-specific primers can decrease the yield of attB PCR 
product. In addition, if RNA is used, a standard reverse transcriptase-PCR (RT- 
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PCR) protocol should be followed; see, e.g., Gerard, G.F., et al, FOCUS 1 7:60 
(1989); Myers, T.W., and Gelfand, D.H., Biochem. 30:1661 (1991); Freeman, 
W.N., etal.,BioTechniques 20:782(1996); andU.S. ApplicationNo. 09/064,057, 
filed April 22, 1998, the disclosures of all of which are incorporated herein by 
reference.) 

I s ' PCR profile: 

(a) 95°Cfor3 minutes 

(b) 1 0 cycles of: 

(1) 94°Cfor 15 seconds 

(ii) 50°C* for 30 seconds 

(iii) 68 °C for 1 minute/kb of target amplicon 

(c) 68 °C for 5 minutes 

(d) 10°Chold 

*The optimal annealing temperature is determined by the calculated Tm of the 
gene-specific part of the primer. 

(2) Transfer 1 0 ul to a 40 ul PCR reaction mix containing 35 pmoles each 
of the attBl and attB2 adapter primers. 

2 nd PCR profile: 

(a) 95°Cfor 1 minute 

(b) 5 cycles of: 

(i) 94°Cfor 15 seconds 

(ii) 45°C* for 30 seconds 

(iii) 68 °C for 1 minute/kb of target amplicon 

(c) 15-20 cycles** of: 

(i) 94°Cfor 15 seconds 

(ii) 55 °C* for 30 seconds 
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(iii) 68 °C for 1 minute/kb of target amplicon 

(d) 68 °C for 5 minutes 

(e) 10°Chold 

*The optimal annealing temperature is determined by the calculated Tm of the 

gene-specific part of the primer. 

** 15 cycles is sufficient for low complexity targets. 

Notes: 

L It is useful to perform a no-adapter primer control to assess the yield of 
attB PCR product produced. 

2. Linearized template usually results in slightly greater yield of PCR 
product. 

Example 26: One-Tube Recombinational Cloning Using the GATEWA Y™ 
Cloning System 

To provide for easier and more rapid cloning using the GATEWAY™ 
cloning system, we have designed a protocol whereby the BP and LR reactions 
may be performed in a single tube (a "one-tube" protocol). The following is an 
example of such a one-tube protocol; in this example, an aliquot of the BP 
reaction is taken before adding the LR components, but the BP and LR reactions 
may be performed in a one-tube protocol without first taking the BP aliquot: 



Reaction Component Volume 

attB DNA (100-200 ng/25 jil reaction) 1-12.5 ^1 

attP DNA (pDONR201) 150 ng/pd 2.5 jxl 

5X BP Reaction Buffer 5.0 |il 

Tris-EDTA (to 2 o ^1) 

BP Clonase 5 q ^\ 

Total vol. 25 jil 
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After the above components were mixed in a single tube, the reaction 
mixtures were incubated for 4 hours at 25 °C. A 5 jul aliquot of reaction mixture 
was removed, and 0.5 jal of 10X stop solution was added to this reaction mixture 
and incubated for 10 minutes at 37°C. Competent cells were then transformed 
with 1-2 \il of the BP reaction per 100 (al of cells; this transformation yielded 
colonies of Entry Clones for isolation of individual Entry Clones and for 
quantitation of the BP Reaction efficiency. 

To the remaining 20 \il of BP reaction mixture, the following components 
of the LR reaction were added: 



Reaction Component Final Concentration Volume Added 

NaCl 0.75 M l 

Destination Vector 150ng/ul 3 jil 

LR Clonase 5 ^\ 

Total vol. 30 



After the above components were mixed in a single tube, the reaction 
mixtures were incubated for 2 hours at 25 °C. 3 ^1 of 10X stop solution was 
added, and the mixture was incubated for 10 minutes at 37°C. Competent cells 
were then transformed with 1-2 |lx1 of the reaction mixture per 100 jal of cells 

Notes: 

1 . If desired, the Destination Vector can be added to the initial BP reaction. 

2. The reactions can be scaled down by 2x, if desired. 

3 Shorter incubation times for the BP and/or LR reactions can be used 

(scaled to the desired cloning efficiencies of the reaction), but a lower 

number of colonies will typically result. 
4. To increase the number of colonies obtained by several fold, incubate the 

BP reaction for 6-20 hours and increase the LR reaction to 3 hours. 

Electroporation also works well with 1-2 ul of the PK-treated reaction 

mixture. 
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5. PCR products greater than about 5 kb may show significantly lower 
cloning efficiency in the BP reaction. In this case, we recommend using 
a one-tube reaction with longer incubation times (e.g., 6-18 hours) for 
both the BP and LR steps. 



Example 27: Relaxation of Destination Vectors During the LR Reaction 

To further optimize the LR Reaction, the composition of the LR Reaction 
buffer was modified from that described above and this modified buffer was used 
in a protocol to examine the impact of enzymatic relaxation of Destination Vectors 
during the LR Reaction. 

LR Reactions were set up as usual (see, e.g. , Example 6), except that 5X 
BP Reaction Buffer (see Example 5) was used for the LR Reaction. To 
accomplish Destination Vector relaxation during the LR Reaction, 
Topoisomerase I (Life Technologies, Inc., Rockville, MD; Catalogue No. 38042- 
016) was added to the reaction mixture at a final concentration of ~15U per jig 
of total DNA in the reaction (for example, for reaction mixtures with a total of 
400ng DNA in the 20 \i\ LR Reaction, ~6units of Topoisomerase I was added). 
Reaction mixtures were set up as follows: 



Reaction Component Volume 

ddH 2 0 6.5 |il 

4X BP Reaction Buffer 5 jil 
lOOng single chain/linear pENTR CAT, 50 ng/jil 2 ^1 

300ng single chain/linear pDEST6, lSOng/^l 2 jlxI 

Topoisomerase I, 15 U/ml 0.5 (il 

LR Clonase 4 



Reaction mixtures were incubated at 25°C for lhour, and 2 jllI of 2 jag/jal 
Proteinase K was then added and mixtures incubated for 10 minutes at 37 °C to 
stop the LR Reaction. Competent cells were then transformed as described in the 
preceding examples. The results of these studies demonstrated that relaxation of 
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substrates in the LR reaction using Topoisomerase I resulted in a 2- to 10-fold 
increase in colony output compared to those LR reactions performed without 
including Topoisomerase I. 

Having now fully described the present invention in some detail by way of 
illustration and example for purposes of clarity of understanding, it will be obvious 
to one of ordinary skill in the art that the same can be performed by modifying or 
changing the invention within a wide and equivalent range of conditions, 
formulations and other parameters without affecting the scope of the invention or 
any specific embodiment thereof, and that such modifications or changes are 
intended to be encompassed within the scope of the appended claims. 

All publications, patents and patent applications mentioned in this 
specification are indicative of the level of skill of those skilled in the art to which 
this invention pertains, and are herein incorporated by reference to the same extent 
as if each individual publication, patent or patent application was specifically and 
individually indicated to be incorporated by reference. 
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A. The indications made below relate to the microorganism referred to in the description on page 52 , line 



B. IDENTIFICATION OF DEPOSIT Further deposits are identified on an additional sheet H 

Name of depositary institution 

Agricultural Research Culture Collection (NRRL) 
International Depository Authority 



Address of depositary institution (including postal code and country) 

1815 N. University Street 
Peoria, Illinois 61604 
United States of America 



Date of deposit 
February 27, 1999 



Accession Number 
NRRL B-30099 



C. ADDITIONAL INDICATIONS (leave blank if not applicable) ™ s information is continued on an additional sheet □ 



Escherichia coli DB3. 1 (pAHPKan) or Escherichia coli DB3. l(pAttPKan) 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE (if the indications are not for all designated States; 



E. SEPARATE FURNISHING OF INDICATIONS (leave blank if not applicable) 



The indications listed below will be submitted to the international Bureau later (specify the general nature of the indications, e.g., 
"Accession Number of Deposit") 



For receiving Office use only 



l&^fhis 



is sheet was received with the international application 



Authorized officer 



: c;-:220 (f; 



For International Bureau use only 



□ This sheet was received by the International Bureau on: 



Authorized officer 



Form PCT/RO/134 (July 1998) 



mo-30099 



WO 00/52027 



PCT/US00/05432 



Applicant's or agent's file 

reference number 0942.468PC03 



,167.2 



International application No. tl 

PCT/ilS nn/f]543? 



INDICATIONS RELATING TO DEPOSITED MICROORGANISM" 
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(PCTRule \3bis) 
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A. The indications made below relate to the microorganism referred to in the descrmtion on Daee 
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B. IDENTIFICATION OF DEPOSIT Further deposits are identified on an additional sheet El 

Name of depositary institution 

Agricultural Research Culture Collection (NRRL) 
International Depository Authority 



Address of depositary institution (including postal code and country) 

1815 N. University Street 
Peoria, Illinois 61604 
United States of America 



Date of deposit 
February 27. 1999 



Accession Number 
NRRL B-30100 



C. ADDITIONAL INDICATIONS (leave blank if not applicable) 



This information is continued on an additional sheet □ 



Escherichia coli DB3.1(pENTR-lA) 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE (if the indications ore not for ail designated States) 



E. SEPARATE FURNISHING OF INDICATIONS (leave blank if not applicable) 



The indications listed below will be submitted to the international Bureau later (specify the general nature of the indications, e.g., 
"Accession ,\'umber of Deposit") 



For receiving Office use only 



^^This sheet was received with the international application 



Authorized officer 



^mara Frier-, ~^f^ 



r\ . ■ . „ . 



For International Bureau use onlv 



n This sheet was received bv the international Bureau on: 



Authorized officer 



Form PCT RO'13- iJuly 1998) 



mo-30100 



WO 00/52027 



PCT/US00/05432 



1fi7 ^ 



Applicant's or agent's file 
reference number 



0942.468PC03 



International application No. tb~. Q Q y 



INDICATIONS RELATING TO DEPOSITED MICROORGANISM 
OR OTHER BIOLOGICAL MATERIAL 

(PCTRule 136/5) 

: :.j30 

A. The indications made below relate to the microorganism referred to in the description on pa|e~jjp^~; - 'lme — 




B. IDENTIFICATION OF DEPOSIT Further deposits are identified on an additional sheet H 

Name of depositary institution 

Agricultural Research Culture Collection (NRRL) 
International Depository Authority 



Address of depositary institution (including postal code and country) 

1815 N. University Street 
Peoria, Illinois 61604 
United States of America 



Date of deposit 
February 27 ; 1999 



Accession Number 
NRRL B-30101 



C. ADDITIONAL INDICATIONS (leave blank if not applicable) 



This information is continued on an additional sheet □ 



Escherichia coli DB3.1(pENTR-2B) 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE (if the indications are not for all designated States) 



E. SEPARATE FURNISHING OF INDICATIONS (leave blank if not applicable) 



The indications listed below will be submitted to the international Bureau later (specify the general nature of the indications, e.g., 
"Accession Number of Deposit") 



For receiving Office use only 



This sheet was received with the international application 



Authorized officer 



For International Bureau use only 



□ This sheet was received by the International Bureau on: 



Authorized officer 



Form PCT/RO/134 U^iy 1998) 



mo-30101 



WO 00/52027 PCT/US00/05432 



167.4 



Applicant's or agent's file 

reference number 0942.468PC03 



Internationalj^p^gNo. ^ Q / Q 5 4 3 2 
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OR OTHER BIOLOGICAL MATERIAL 

(PCTRule 13 bis) 



MICROORGANISM 



REC'D 1 7 APR 2000 
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A. The indications made below relate to the microorganism referred to in the description on page _55 , line 
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B. IDENTIFICATION OF DEPOSIT 



Further deposits are identified on an additional sheet £3 



Name of depositary institution 

Agricultural Research Culture Collection (NRRL) 
International Depository Authority 



Address of depositary institution (including postal code and country) 

1815 N. University Street 
Peoria, Illinois 61604 
United States of America 



Date of deposit 
February 27, 1999 



Accession Number 
NRRL B-30102 



C. ADDITIONAL INDICATIONS (leave blank if not applicable) 



This information is continued on an additional sheet □ 



Escherichia coli DB3.1(pENTR-3C) 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE (if the indications are not for all designated States) 



E. SEPARATE FURNISHING OF INDICATIONS (leave blank if not applicable) 



The indications listed below will be submitted to the international Bureau later (specify the general nature of the indications, e.g., 
"Accession Number of Deposit") 



For receiving Office use only 



This sheet was received with the international application 

sj 



Authonzed officer 



For International Bureau use only 



□ This sheet was received by the International Bureau on: 



Authorized officer 



FormPCTVRO/134 iJuK 1998) 



mo-30102 



WO 00/52027 



PCT7US00/05432 



167.5 



Applicant's or agent's file 

reference number 0942.468PC03 


International application No. tb^ 

PCT/US . 
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A. The indications made below relate to the microorganism referred to in the description or 
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B. IDENTIFICATION OF DEPOSIT 


Further deposits are identified on an additional sheet H 



Name of depositary institution 

Agricultural Research Culture Collection (NRRL) 
International Depository Authority 



Address of depositary institution (including postal code and country) 

1815 N. University Street 
Peoria, Illinois 61604 
United States of America 



Date of deposit 
February 27, 1999 



Accession Number 
NRRL B-30I03 



C. ADDITIONAL INDICATIONS (leave blank if not applicable) 



This information is continued on an additional sheet □ 



Escherichia coli DB3.1(pE2C15101) 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE (if the indications are not for all designated States) 



E. SEPARATE FURNISHING OF INDICATIONS (leave blank if not applicable) 



The indications listed below will be submitted to the international Bureau later (specify the general nature of the indications, e.g., 
"Accession Number of Deposit") 



For receiving Office use only 



^TThis sheet was received with the international application 



Authorized officer 



For International Bureau use only 



□ This sheet was received by the International Bureau on: 



Authorized officer 



Form PCT RO/134 (July 1998) 



mo-30103 



WO 00/52027 



PCT/US00/05432 



Applicant's or agent's file 

reference number 0942.468PC03 



167.6 
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A. The indications made below relate to the microorganism referred to in the description on page 54 line 



B. IDENTIFICATION OF DEPOSIT Funher depos.ts are identified on an additional sheet B 

Name of depositary institution 

Agricultural Research Culture Collection (NRRL) 
International Depository Authority 



Address of depositary institution (including postal code and country) 

1815 N. University Street 
Peoria, Illinois 61604 
United States of America 



Date of deposit 
February 27, 1999 



Accession Number 
NRRL B-30104 



C. ADDITIONAL INDICATIONS (leave blank if not applicable) 



This information is continued on an additional sheet □ 



Escherichia coli DB3.1(pEZC15102) 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE 



(if the indications ore not for all designated States) 



E. SEPARATE FURNISHING OF INDICATIONS (leave blank if not applicable) 



The indications listed below will be submitted to the international Bureau later (specify the general nature of the indications, e.g., 
"Accession Number of Deposit") 



For receiving Office use only 



I^This sheet was received with the international application 



Authonzed officer ^™?^\~P 7 ^ ■ 



For International Bureau use only 



□ This sheet was received by the International Bureau on: 



Authorized officer 



Form PCT/RO/134 v Julv 1998) 



mo-3010-4 



WO 00/52027 



PCT/US00/05432 



Applicant's or agent's file 

reference number 0942.468PC03 
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INDICATIONS RELATING TO DEPOSITED MICROORGAiBS&fvi ' 7 AJ5R ' 
OR OTHER BIOLOGICAL MATERIAL 

(PCTRule 13 bis) 
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A. The indications made below relate to the microorganism referred to in the description on page 54 line 
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B. IDENTIFICATION OF DEPOSIT 



Further deposits are identified on an additional sheet 13 



Name of depositary institution 

Agricultural Research Culture Collection (NRRL) 
International Depository Authority 



Address of depositary institution (including postal code and country) 

1815 N. University Street 
Peoria, Illinois 61604 
United States of America 



Date of deposit 
February 27, 1999 



Accession Number 
NRRL B-30105 



C. ADDITIONAL INDICATIONS (leave blank if not applicable) 



This information is continued on an additional sheet □ 



Escherichia coli DB3. l(pEZC15103) 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE (if the indications are not for all designated States) 



E. SEPARATE FURNISHING OF INDICATIONS (leave blank if not applicable) 



The indications listed below will be submitted to the international Bureau later (specify the general nature of the indications, e.g., 
"Accession Number of Deposit") 



For receiving Office use only 



^This sheet was received with the international application 



Authorized officer 



For International Bureau use onlv 



D This sheet was received by the International Bureau on: 



Authorized officer 



Form PCT/RO/134 (July 1998) 



mo-30105 



WO 00/52027 



PCT/US00/05432 



167.8 



Applicant's or agent's file 

reference number 0942.to8PC03 



International application No. tl 
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INDICATIONS RELATING TO DEPOSITED MICRO 
OR OTHER BIOLOGICAL MATERIAL 

(PCTRule \3bis) , y, 




A. The indications made below relate to the microorganism referred to in the description on page 51 line 
20-21 ' 



B. IDENTIFICATION OF DEPOSIT 



Further deposits are identified on an additional sheet & 



Name of depositary institution 

Agricultural Research Culture Collection (NRRL) 

International Depository Authority 



Address of depositary institution (including postal code and country) 

1815 N. University Street 
Peoria, Illinois 61604 
United States of America 



Date of deposit 
February 27, 1999 



Accession Number 
NRRL B-30108 



C. ADDITIONAL INDICATIONS (leave blank if not applicable) This information is continued on an additional sheet □ 

Escherichia coli DB 1 0B(pCMVSport6) 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE (if the indications are not for all designated States) 



E. SEPARATE FURNISHING OF INDICATIONS (leave blank if not applicable) 



The indications listed below will be submitted to the international Bureau later (specify the general nature of the indications, e.g., 
"Accession Number of Deposit") 



For receiving Office use only 



This sheet was received with the international application 



Authorized officer 



For International Bureau use only 



□ This sheet was received by the international Bureau on: 



Authorized officer 



Form PCT/RO/134 (July 1998) 



mo-30108 
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WHAT IS CLAIMED IS: 

1 An isolated nucleic acid molecule comprising a nucleotide sequence 

selected from the group of nucleotide sequences consisting of an attBl nucleotide 
sequence as set forth in Figure 9, an attB2 nucleotide sequence as set forth in 
Figure 9, an attPl nucleotide sequence as set forth in Figure 9, an attP2 nucleotide 
sequence as set forth in Figure 9, an attLl nucleotide sequence as set forth in 
Figure 9, an attL2 nucleotide sequence as set forth in Figure 9, an attRl 
nucleotide sequence as set forth in Figure 9, an attR2 nucleotide sequence as set 
forth in Figure 9, a polynucleotide complementary thereto, and a mutant, 
fragment, or derivative thereof 

2. An isolated nucleic acid molecule comprising an attB 1 nucleotide 
sequence as set forth in Figure 9, a polynucleotide complementary thereto, or a 
mutant, fragment, variant or derivative thereof 

3 . An isolated nucleic acid molecule comprising an attB2 nucleotide 
sequence as set forth in Figure 9, a polynucleotide complementary thereto, or a 
mutant, fragment, variant or derivative thereof 

4. An isolated nucleic acid molecule comprising an attPl nucleotide 
sequence as set forth in Figure 9, a polynucleotide complementary thereto, or a 
mutant, fragment, variant or derivative thereof 

5. An isolated nucleic acid molecule comprising an attP2 nucleotide 
sequence as set forth in Figure 9, a polynucleotide complementary thereto, or a 
mutant, fragment, variant or derivative thereof 

6. An isolated nucleic acid molecule comprising an attLl nucleotide 
sequence as set forth in Figure 9, a polynucleotide complementary thereto, or a 
mutant, fragment, variant or derivative thereof. 
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7. An isolated nucleic acid molecule comprising an attL2 nucleotide 
sequence as set forth in Figure 9, a polynucleotide complementary thereto, or a 
mutant, fragment, variant or derivative thereof 

8. An isolated nucleic acid molecule comprising an attRl nucleotide 
sequence as set forth in Figure 9, a polynucleotide complementary thereto, or a 
mutant, fragment, variant or derivative thereof 

9. An isolated nucleic acid molecule comprising an attR2 nucleotide 
sequence as set forth in Figure 9, a polynucleotide complementary thereto, or a 
mutant, fragment, variant or derivative thereof 

10. The isolated nucleic acid molecule of claim 1, further comprising 
one or more functional or structural nucleotide sequences selected from the group 
consisting of one or more multiple cloning sites, one or more localization signals, 
one or more transcription termination sites, one or more transcriptional regulatory 
sequences, one or more translational signals, one or more origins of replication, 
one or more fusion partner peptide-encoding nucleic acid molecules, one or more 
protease cleavage sites, and one or more 5' polynucleotide extensions. 

1 1 The nucleic acid molecule of claim 1 0, wherein said transcriptional 
regulatory sequence is a promoter, an enhancer, or a repressor. 

12. The nucleic acid molecule of claim 1 0, wherein said fusion partner 
peptide-encoding nucleic acid molecule encodes glutathione S-transferase (GST), 
hexahistidine (His 6 ), or thioredoxin (Trx). 

13. The nucleic acid molecule of claim 10, wherein said 5' 
polynucleotide extension consists of from one to five nucleotide bases. 



14. The nucleic acid molecule of claim 13, wherein said 5' 
polynucleotide extension consists of four or five guanine nucleotide bases. 
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15. A primer nucleic acid molecule suitable for amplifying a target 
nucleotide sequence, comprising the isolated nucleic acid molecule of claim 1 or 
a portion thereof linked to a target-specific nucleotide sequence useful in 
amplifying said target nucleotide sequence. 

1 6. The primer nucleic acid molecule of claim 1 5, wherein said primer 
comprises an attB 1 nucleotide sequence having the sequence shown in Figure 9 
or a portion thereof, or a polynucleotide complementary to the sequence shown 
in Figure 9 or a portion thereof. 

17. The primer nucleic acid molecule of claim 1 5, wherein said primer 
comprises an attB2 nucleotide sequence having the sequence shown in Figure 9 
or a portion thereof, or a polynucleotide complementary to the sequence shown 
in Figure 9 or a portion thereof. 



18. The primer nucleic acid molecule of claim 15, further comprising 
a 5' terminal extension of four or five guanine bases. 

19. A vector comprising the isolated nucleic acid molecule of claim 1 . 

20. The vector of claim 19, wherein said vector is an Expression 

Vector. 

21. A host cell comprising the isolated nucleic acid molecule of claim 1 
or the vector of claim 19. 



22. A method of synthesizing or amplifying one or more nucleic acid 
molecules comprising: 

(a) mixing one or more nucleic acid templates with at least one 
polypeptide having polymerase or reverse transcriptase activity 
and at least a first primer comprising a template-specific sequence 
that is complementary to or capable of hybridizing to said 
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templates and at least a second primer comprising all or a portion 
of a recombination site wherein said at least a portion of said 
second primer is homologous to or complementary to at least a 
portion of said first primer, and 
(b) incubating said mixture under conditions sufficient to synthesize or 
amplify one or more nucleic acid molecules complementary to all 
or a portion of said templates and comprising one or more 
recombination sites or portions thereof at one or both termini of 
said molecules. 

23 A method of synthesizing or amplifying one or more nucleic acid 
molecules comprising: 

(a) mixing one or more nucleic acid templates with at least one 
polypeptide having polymerase or reverse transcriptase activity 
and at least a first primer comprising a template-specific sequence 
that is complementary to or capable of hybridizing to said 
templates and at least a portion of a recombination site, and at 
least a second primer comprising all or a portion of a 
recombination site wherein said at least a portion of said 
recombination site on said second primer is complementary to or 
homologous to at least a portion of said recombination site on said 
first primer; and 

(b) incubating said mixture under conditions sufficient to synthesize or 
amplify one or more nucleic acid molecules complementary to all 
or a portion of said templates and comprising one or more 
recombination sites or portions thereof at one or both termini of 
said molecules. 

24. A method of amplifying or synthesizing one or more nucleic acid 
molecules comprising: 

(a) mixing one or more nucleic acid templates with at least one 
polypeptide having polymerase or reverse transcriptase activity 
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and one or more first primers comprising at least a portion of a 
recombination site and a template-specific sequence that is 
complementary to or capable of hybridizing to said template; 

(b) incubating said mixture under conditions sufficient to synthesize or 
amplify one or more first nucleic acid molecules complementary to 
all or a portion of said templates wherein said molecules comprise 
at least a portion of a recombination site at one or both termini of 
said molecules; 

(c) mixing said molecules with one or more second primers 
comprising one or more recombination sites, wherein said 
recombination sites of said second primers are homologous to or 
complementary to at least a portion of said recombination sites on 
said first nucleic acid molecules; and 

(d) incubating said mixture under conditions sufficient to synthesize or 
amplify one or more second nucleic acid molecules complementary 
to all or a portion of said first nucleic acid molecules and which 
comprise one or more recombination sites at one or both termini 
of said molecules. 



25 A polypeptide encoded by the isolated nucleic acid molecule of any 
one of claims 1-10. 

26. An isolated nucleic acid molecule comprising one or more att 
recombination sites comprising at least one mutation in its core region that 
increases the specificity of interaction between said recombination site and a 
second att recombination site. 



27. The isolated nucleic acid molecule of claim 26, wherein said 
mutation is at least one substitution mutation of at least one nucleotide in the 
seven basepair overlap region of said core region of said recombination site. 
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28 . The isolated nucleic acid molecule of claim 26, wherein said nucleic 
acid molecule comprises the sequence NNNATAC, wherein "N" refers to any 
nucleotide with the proviso that if one of the first three nucleotides in the 
consensus sequence is a T/U, then at least one of the other two of the first three 
5 nucelotides is not a T/U. 



29. An isolated nucleic acid molecule comprising one or more mutated 
att recombination sites comprising at least one mutation in its core region that 
enhances the efficiency of recombination between a first nucleic acid molecule 
comprising said mutated att recombination site and a second nucleic acid molecule 
comprising a second recombination site that interacts with said mutated att 
recombination site. 



30. The isolated nucleic acid molecule of claim 29, wherein said 
mutated att recombination site is a mutated attL site comprising a core region 
having the nucleotide sequence caacttnntnnnannaagttg, wherein "n" represents any 
nucleotide. 



31. The isolated nucleic acid molecule of claim 30, wherein said 
mutated attL recombination site comprises a core region having a nucleotide 
sequence selected from agcctgctttattatactaagttggcatta (at(L5) and 
agcctgcttttttatattaagttggcatta (attL6) . 

32. The isolated nucleic acid molecule of claim 29, wherein said 
mutated att recombination site comprises a core region having a nucleotide 
sequence selected from the group consisting of ggggacaactttgtacaaaaaagttggct 
(attB 1.6), ggggacaactttgtacaagaaagctgggt (a//B2.2), and 
ggggacaactttgtacaagaaagttgggt (attB2. 10). 



30 



33. A vector selected from the group consisting of pENTRIA, 
pENTR2B, pENTR3C, pENTR4, pENTRS, pENTR6, pENTR7, pENTR8, 
pENTR9, pENTRlO, pENTRll, pDESTl, pDEST2, pDEST3, pDEST4, 
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pDEST5, pDEST6, pDEST7, pDESTS, pDEST9, pDESTIO, pDESTll, 
pDEST12.2 (also known as pDEST12), pDEST13, pDEST14, pDEST15, 
pDEST16, pDEST17, pDEST18, pDEST19, pDEST20, pDEST21, pDEST22 , 
pDEST23, pDEST24, pDEST25, pDEST26, pDEST27, pDEST28, pDEST29, 
pDEST30, pDEST3 1, pDEST32, pDEST33, pDEST34, pDONR201 (also known 
as pENTR2 1 attP vector or pAttPkan Donor Vector), pDONR202, pDONR203 
(also known as pEZ15812), pDONR204 ? pDONR205, pDONR206 (also known 
as pENTR22 attP vector or pAttPgen Donor Vector), pDONR207, pMAB58 ? 
pMAB62, pMAB85 and pMAB86. 

34. A host cell comprising the vector of claim 33 . 

35. A polypeptide encoded by the vector of claim 3 3 . 

36. A kit for use in synthesizing a nucleic acid molecule, said kit 
comprising the isolated nucleic acid molecule of any one of claims 1-10, 26 
and 29. 

37. A kit for use in synthesizing a nucleic acid molecule, said kit 
comprising the primer of claim 15 or claim 18. 

38. A kit for use in cloning a nucleic acid molecule, said kit comprising 
the vector of claim 19 or claim 33. 
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Recombination Site Nucleotide Sequences 



attBl: 5'-ACAAGTTTGTACAAAAAAGCAGGCT-3 



attB2: 5'-ACCCAGCTTTCTTGTACAAAGTGGT-3 



attPl : 5-TACAGGTCACTAATACCATCTAAGTAGTTGATTCATAGTGACTGGATATG- 
TTGTGTTTTACAGTATTATGTAGTCTGTTTTTTATGCAAAATCTAATTTA- 
ATATATTGATATTTATATCATTTTACGTTTCTCGTTCAGCTTTTTTGTAC- 
AAAGTTGGCATTATAAAAAAGCATTGCTCATCAATTTGTTGCAACGAACA- 
GGTC ACT ATC AGTC AAAAT AAAATC ATT ATTTG-3 ' 



attP2 : 5'-CAAATAATGATTTTATTTTGACTGATAGTGACCTGTTCGTTGCAACAAAT- 
TGATAAGCAATGCTTTCTTATAATGCCAACTTTGTACAAGAAAGCTGAAC- 
GAGAAACGTAAAATGATATAAATATCAATATATTAAATTAGATTTTGCAT- 
AAAAAACAGACTACATAATACTGTAAAACACAACATATCCAGTCACTATGA- 
ATCAACTACTTAGATGGTATTAGTGACCTGTA-3' 



attRl : 5'-ACAAGTTTGTACAAAAAAGCTGAACGAGAAACGTAAAATGATATAAA- 
TATCAATATATTAAATTAGATTTTGCATAAAAAACAGACTACATAATAC- 
TGTAAAACAC AACATATCCAGTCACTATG-3 ' 



attR2 : 5'-GCAGGTCGACCATAGTGACTGGATATGTTGTGTTTTACAGTATTAT- 
GTAGTCTGTTTTTTATGCAAAATCTAATTTAATATATTGAT ATTT- 
ATATCATTTTACGTTTCTCGTTCAGCTTTCTTGTACAAAGTGGT-3' 



attLl : 5'-CAAATAATGATTTTATTTTGACTGATAGTGACCTGTTCGTTGCAAC- 
AAATTGATAAGCAATGCTTTTTTATAATGCCAACTTTGTACAAAAAA- 
GCAGGCT-3' 



attL2: 5-CAAATAATGATTTTATTTTGACTGATAGTGACCTGTTCGTTGCAACAA- 
ATTGATAAGCAATGCTTTCTTATAATGCCAACTTTGTACAAGAAAGCTGGGT-3 1 



Figure 9 
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Figure IDA: Cloning sites of the on t% Vector p&^lb (reading 
frame A) 
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ACT TTG TAC AAA AAA GCA GGC TTT (AAA GGA ACCIAAT TCA G [TC GA C TG P ATC CG G TACI CG |A ATT C 

TGA AAC ATG TTT TTT CGT CCG AAA ITTT CCT TGGiTTA AGT CAG CHG ACCTAGl GCCATG GCT TAAlG 

thr leu tyr lys lys ala gly phe lys gly thr asn ser val asp trp ile arg tyr arg i 1 e 



EcoR I 



Not I 



Xho I EcoR V 



n r 



"i r 



ccaB gene 



• G I AAT T CG C 1GG CC G CAC jTCGAGA T]AT CTA GAC CCA GCT TTC TTG TAC AAA 
C TTA AlGC GCC GGC GTG AGC TlCT AJTA GAT CTG GGT CGA AAG AAC ATG TTT 



Psp1406 I 2634 
BsrB I 2623 
Bbs I 2553 
AatU 2488 



Dra I 166 



Nhe I 2470 

Nsp I 2420 
&spLU11 I 2420 

Aft III 2420 
Drd I 2312 



Kpn I 1 97 

BssH II 243 
Bsn 107 I 282 
Srfl 401 



BssS ! 2247 
HaeH 2176 

AiwN I 2006 - 
Eco57 I 1 893 




multiple 
- cloning 
site 



Sma I 402 
Mam I 413 

BsfX I 514 

Bsa I 536 
Nofl632 



EcoR V 646 



PvuW 760 
Pst I 763 

Rca I 826 



Nru I 957 



PffM I 1559 

Vsp\ 1500 



EcoN I 1210 
Ssp I 1225 
Esp3M312 \ Sgf\ 1298 
Pvul 1299 



multiple 

cloning 

site 



WO 00/52027 



PCT/US00/05432 



pENTRIA 2717 bp 



Base Nos . Gene Encoded 

67 . . 166 attLl 

321 . . 626 ccdB 

655 . . 754 attL2 

877 . . 1686 KmR 

1791. .2364 ori 



1 CTGACGGATG GCCTTTTTGC GTTTCTACAA ACTCTTCCTG TTAGTTAGTT ACTTAAGCTC 

61 GGGCCCCAAA TAATGATTTT ATTTTGACTG ATAGTGACCT GTTCGTTGCA ACAAATTGAT 

121 AAGCAATGCT TTTTTATAAT GCCAACTTTG TACAAAAAAG CAGGCTTTAA AGGAACCAAT 

181 TCAGTCGACT GGATCCGGTA CCGAATTCGC TTACTAAAAG CCAGATAACA G TATGC GTAT 

241 TTGCGCGCTG ATTTTTGCGG TATAAGAATA TATACTGATA TGTATACCCG AAGTATGTCA 

3 01 AAAAGAGGTG TGCTTCTAGA ATGCAGTTTA AGGTTTACAC CTATAAAAGA GAGAGC CGTT 

361 ATCGTCTGTT TGTGGATGTA CAGAGTGATA TTATTGACAC GCCCGGGCGA CGGATAGTGA 

421 TCCCCCTGGC CAGTGCACGT CTGCTGTCAG ATAAAGTCTC CCGTGAACTT TACCCGGTGG 

481 TGCATATCGG GGATGAAAGC TGGCGCATGA TGACCACCGA TATGGCCAGT GTGCCGGTCT 

541 CCGTTATCGG GGAAGAAGTG GCTGATCTCA GCCACCGCGA AAATG AC AT C AAAAACG C C A 

6 01 TTAACCTGAT GTTCTGGGGA ATATAGAATT CGCGGCCGCA CTCGAGATAT CTAGACCCAG 

661 CTTTCTTGTA CAAAGTTGGC ATTATAAGAA AGCATTGCTT ATCAATTTGT TGCAACGAAC 

721 AGGTCACTAT CAGTCAAAAT AAAATCATTA TTTGCCATCC AGCTGCAGCT CTGGCCCGTG 

781 TCTCAAAATC TCTGATGTTA CATTGCACAA GATAAAAATA TAT CAT C ATG AACAATAAAA 

841 CTGTCTGCTT ACATAAACAG TAATACAAGG GGTGTTATGA GCCATATTCA ACGGGAAACG 

901 TCGAGGCCGC GATTAAATTC CAACATGGAT GCTGATTTAT ATGGGTATAA ATGGGCTCGC 

961 GATAATGTCG GGCAATCAGG TGCGACAATC TATCGCTTGT ATGGGAAGCC CGATGCGCCA 

1021 GAGTTGTTTC TGAAACATGG C AAAGGT AG C GTTGCCAATG ATGTTACAGA TGAGATGGTC 

1081 AGACTAAACT GGCTGACGGA ATTTATGCCT CTTCCGACCA TCAAGCATTT TATCCGTACT 

1141 CCTGATGATG CATGGTTACT CACCACTGCG ATCCCCGGAA AAAC AG C ATT CCAGGTATTA 

12 01 GAAGAATATC CTGATTCAGG TGAAAATATT GTTGATGCGC TGGCAGTGTC CCTGCGCCGG 

12 61 TTGCATTCGA TTCCTGTTTG TAATTGTCCT TTTAACAGCG ATCGCGTATT TCGTCTCGCT 
1321 CAGGCGCAAT C AC GAATGAA TAACGGTTTG GTTGATGCGA GTGATTTTGA TGACGAGCGT 

13 81 AATGGCTGGC CTGTTGAACA AGT CTGG AAA G AAATG CAT A AACTTTTGCC ATTCTCACCG 
1441 G ATT C AG TC G TCACTCATGG TGATTTCTCA CTTGATAACC TTATTTTTGA CGAGGGGAAA 
1501 TTAATAGGTT GTATTGATGT TGGACGAGTC G GAATCG C AG ACCGATACCA GGATCTTGCC 
1561 ATCCTATGGA ACTGCCTCGG TGAGTTTTCT CCTTCATTAC AGAAACGGCT TTTTCAAAAA 
1621 TATGGTATTG ATAATCCTGA TATGAATAAA TTGCAGTTTC ATTTGATGCT CGATGAGTTT 
1681 TTCTAATCAG AATTGGTTAA TTGGTTGTAA C ATTATT C AG ATTGGGCCCC GTTCCACTGA 
1741 GCGTCAGACC CCGTAGAAAA GAT CAAAGGA TCTTCTTGAG ATCCTTTTTT TCTGCGCGTA 
18 01 ATCTGCTGCT T G C AAAC AAA AAAACCACCG CTACCAGCGG TGGTTTGTTT GCCGGATCAA 
1861 GAG C T AC C AA CTCTTTTTCC GAAGGTAACT GGCTTCAGCA GAG C G C AG AT ACCAAATACT 
1921 GTTCTTCTAG TGTAGCCGTA GTTAGG C C AC CACTTCAAGA ACTCTGTAGC ACCGCCTACA 
1981 TACCTCGCTC TGCTAATCCT GTTACCAGTG GCTGCTGCCA GTGGCGATAA GTCGTGTCTT 
2 041 ACCGGGTTGG AC TCAAG AC G ATAGTTACCG GATAAGG CGC AGCGGTCGGG CTGAACGGGG 
2101 GGTTCGTGCA CACAGCCCAG CTTGGAGCGA ACGACCTACA CCGAACTGAG AT AC C T AC AG 
2161 CGTGAGCTAT GAGAAAGCGC CACGCTTCCC GAAGGGAGAA AGGCGGACAG GTATCCGGTA 
2221 AGCGGCAGGG TCGGAACAGG AGAGCG CACG AGGGAGCTTC CAGGGGGAAA CGCCTGGTAT 
2 281 CTTTATAGTC CTGTCGGGTT TCGCCACCTC TGACTTGAGC GTCGATTTTT GTGATGCTCG 
2 341 TCAGGGGGGC GGAGCCTATG G AAAAACG C C AGCAACGCGG CCTTTTTACG GTTCCTGGCC 
24 01 TTTTGCTGGC CTTTTGCTCA CATGTTCTTT CCTGCGTTAT CCCCTGATTC TGTGGATAAC 
2461 CGTATTACCG CTAGCATGGA TCTCGGGGAC GTCTAACTAC TAAGCGAGAG TAGGGAACTG 
2 521 CCAGGCATCA AATAAAACGA AAGGCTCAGT CGGAAGACTG GGCCTTTCGT TTTATCTGTT 
2 581 GTTTGTCGGT GAACGCTCTC CTGAGTAGGA CAAATCCGCC GGGAGCGGAT TTGAACGTTG 
2 641 TGAAGCAACG GCCCGGAGGG TGGCGGGCAG GACGCCCGCC ATAAACTGCC AGGCATCAAA 
2 701 CTAAGCAGAA GGCCATC 



WO 00/52027 , n Uj PCT/US00/05432 



Figure / W; Cloning Sites of the Entry Vector pENTR2B (reading framed) 



Int 



attLl 



Ehel 



Xmnl 



Sail 



BamHI 



Tlfc TAC AAA AAA GCA GGC TlGG etc CGG AAC CAA TTC AG^ CGA CTG |gAT CCG 

Cf i ,U^. ^.U^ mm^ X^P rrir-l I^^T OTl GGC 



aac atg tttIttt cgt ccg Afcc g:g GCC TTG GTT AAG TCA GCT GAC CTA 

1 2 ^ y y 

Leu Tyr Lys Lys Ala Gly Trp Arg Arg Asn Gin Phe Ser Arg Leu Asp Pro 



Kpnl EcoRI 

GTA d cG IaAT T C- ccdB 
JcATGGC TTA a|s 



EcoRI 



NotI 



Xhol EcoRV Xbal 



Val Pro Asn 



• -g! AAT T CG dpG CC G CAC 1 TCG 
c tta3gc GCC GGb 

Asn Ser Arg 



GTG AGC TCT 



ICTA G 


hC CCA 
TO GGT 


GAT C 

\ 


V 



Pro His Ser Arg Tyr Leu Asp Pro 



Int 



attL2 



GCT Tl jc TTG TAC AAA G 



CGA AAG AAC ATG 



TTT C 



Ala Phe Leu Tyr Lys 



WO 00/52027 I ^ ^ PCT/US00/05432 



pENTR2B 2718 bp 



Location (Base Nos . ) Gene Encoded 

67 . . 166 attLl 

322 . . 627 ccdB 

656 . . 755 attL2 

878 . . 1687 KmR 

1792 . .2365 ori 

1 CTGACGGATG GCCTTTTTGC GTTTCTACAA ACTCTTCCTG TTAGTTAGTT ACTTAAGCTC 

61 GGGCCCCAAA TAATGATTTT ATTTTGACTG AT AGTGAC CT GTTCGTTGCA ACAAATTGAT 

121 AAGCAATGCT TTTTTATAAT GCCAACTTTG TACAAAAAAG CAGGCTGGCG CCGGAACCAA 

181 TTCAGTCGAC TGGATCCGGT ACCGAATTCG CTTACTAAAA G C C AG AT AAC AGTATGCGTA 

241 TTTGCGCGCT GATTTTTGCG GTATAAGAAT ATATACTGAT ATGTAT AC C C GAAGTATGTC 

3 01 AAAAAG AG GT GTGCTTCTAG AATGCAGTTT AAGGTTTACA CCTATAAAAG AGAGAGCCGT 

3 61 TATCGTCTGT TTGTGGATGT ACAGAGTGAT ATTATTGACA CGCCCGGGCG ACGGATGGTG 
421 ATCCCCCTGG CCAGTGCACG TCTGCTGTCA GATAAAGTCT CCCGTGAACT TTACCCGGTG 

4 81 G TG C AT AT C G GGGATGAAAG CTGGCGCATG ATGACCACCG ATATGGCCAG TGTGCCGGTC 
541 TCCGTTATCG GGGAAGAAGT GGCTGATCTC AGCCACCGCG AAAATGACAT CAAAAACGCC 
6 01 ATTAACCTGA TGTTCTGGGG AATATAGAAT TCGCGGCCGC ACTCGAGATA TCTAGACCCA 
661 GCTTTCTTGT ACAAAGTTGG CATTATAAGA AAGCATTGCT TATCAATTTG TTGCAACGAA 
721 CAGGTCACTA TCAGTCAAAA TAAAAT CATT ATTTGCCATC CAGCTGCAGC TCTGGCCCGT 
781 GTCTCAAAAT CTCTGATGTT AC ATTG C AC A AGATAAAAAT ATATCATCAT GAACAATAAA 
841 ACTGTCTGCT TACATAAACA GTAATACAAG GGGTGTTATG AG C CAT AT T C AACGGGAAAC 
901 GTCGAGGCCG CGATTAAATT CCAACATGGA TGCTGATTTA TATGGGTATA AATGGGCTCG 
961 CGATAATGTC GGGCAATCAG GTGCGACAAT CTATCGCTTG TATGGGAAGC CCGATGCGCC 

1021 AGAGTTGTTT CTGAAACATG GCAAAGGTAG CGTTGCCAAT GATGTTACAG ATGAGATGGT 

1081 CAGACTAAAC TGGCTGACGG AATTTATGCC TCTTCCGACC ATCAAGCATT TTATCCGTAC 

1141 TCCTGATGAT GCATGGTTAC TCACCACTGC GATCCCCGGA AAAACAGCAT TCCAGGTATT 

12 01 AGAAGAATAT CCTGATTCAG GTGAAAATAT TGTTGATGCG CTGGCAGTGT TCCTGCGCCG 

12 61 GTTGCATTCG ATTCCTGTTT GTAATTGTCC TTTTAACAGC GATCGCGTAT TTCGTCTCGC 
1321 TCAGGCGCAA TCACGAATGA ATAACGGTTT GGTTGATGCG AGTGATTTTG ATGACGAGCG 

13 81 TAATGGCTGG CCTGTTGAAC AAGTCTGGAA AGAAATG CAT AAACTTTTGC CATTCTCACC 
1441 G G ATT C AG T C GTCACTCATG GTGATTTCTC ACTTGATAAC CTTATTTTTG ACGAGGGGAA 

15 01 ATTAATAGGT TGTATTGATG TTGGACGAGT CGGAATCGCA GACCGATACC AGGATCTTGC 
1561 CATCCTATGG AACTGCCTCG GTGAGTTTTC TCCTTCATTA CAGAAACGGC TTTTTCAAAA 
1621 ATATGGTATT GATAATCCTG ATATGAATAA ATTGCAGTTT CATTTGATGC TCGATGAGTT 

16 81 TTTCTAATCA GAATTGGTTA ATTGGTTGTA ACATTATTCA GATTGGGCCC CGTTCCACTG 
1741 AGCGTCAGAC CCCGTAGAAA AGATCAAAGG ATCTTCTTGA GATCCTTTTT TTCTGCGCGT 
18 01 AATCTGCTGC TTGCAAACAA AAAAAC C AC C GCTACCAGCG GTGGTTTGTT TGC CGGATC A 
1861 AGAGCTACCA ACTCTTTTTC CGAAGGTAAC TGGCTTCAGC AGAGCGCAGA TACCAAATAC 
1921 TGTTCTTCTA GTGTAGCCGT AGTTAGGCCA CCACTTCAAG AACT C TG T AG CACCGCCTAC 
1981 ATACCTCGCT CTGCTAATCC TGTTACCAGT GGCTGCTGCC AGTGGCGATA AGTCGTGTCT 
2 041 TACCGGGTTG GACTCAAGAC G ATAGTT AC C GGATAAGGCG CAGCGGTCGG GCTGAACGGG 
2101 GGGTTCGTGC ACACAGCCCA GCTTGGAGCG AACGACCTAC ACCGAACTGA GATACCTACA 
'2161 GCGTGAGCTA TGAGAAAGCG CCACGCTTCC CGAAGGGAGA AAGG CGG AC A GGTATCCGGT 

2221 AAGCGGCAGG GTCGGAACAG GAGAGCGCAC GAGGGAGCTT CCAGGGGGAA ACGCCTGGTA 

22 81 TCTTTATAGT CCTGTCGGGT TTCGCCACCT CTGACTTGAG CGTCGATTTT TGTGATGCTC 

2 341 GTCAGGGGGG CGGAGCCTAT GGAAAAACGC C AG C AAC G C G GCCTTTTTAC GGTTCCTGGC 

24 01 CTTTTGCTGG CCTTTTGCTC ACATGTTCTT TCCTGCGTTA TCCCCTGATT CTGTGGATAA 

2461 C C G T ATT AC C GCTAGCATGG ATCTCGGGGA CGTCTAACTA C T AAGCG AG A GT AG GG AAC T 

2 521 GCCAGGCATC AAATAAAACG AAAGGCTCAG TCGGAAGACT GGGCCTTTCG TTTTATCTGT 

2581 TGTTTGTCGG TGAACGCTCT CCTGAGTAGG ACAAATCCGC CGGGAGCGGA TTTGAACGTT 

2641 GTGAAGCAAC GGCCCGGAGG GTGGCGGGCA GGACGCCCGC CATAAACTGC CAGGCATCAA 

2 701 ACTAAGCAGA AGGCCATC 



pr^u^ us 



WO 00/52027 . / i^/ PCT/US00/05432 

to 



Figure pfil Cloning Sites of the Entry Vector pENTR3C (reading frame C) 



Int 



attLI 



Dral 



XmnI 



Sail 



BamHI 



TT fe TAC AAA AAA GCA GGC ijCT T"i{a AAG GAA CCA ATT CAG ItCGACT g )gA TCC GGT 



AAC ATG TTT 



TTT CGT CCG A|GA AAT TTC CTT GGT TAA GTC AGC IpA CCT AG*3 dCA 
Leu Tyr Lys Lys Ala Gly Ser Leu Lys Glu Pro lie Gin Ser Thr Gly Ser Gly 



Kpnl EcoRI 



Pvul 



EcoRI 



NotI 



Xhol 



EcoRV Xbal 



Acfc G| AA TT C GAT cb c-- 
TGG CTT A^jGjCTA GCG 

Thr Glu Phe 



ccdB — g IaAT TC G ( jGG CCG CAC [tCGAGA tUt | CTA 
C TTA AbC GCC GGfc GTG AGC TfcT ATA GAT 

Asn Ser Arg Pro His Ser Arg Tyr Leu 



attL2 



Int 



GAC CCA GCT TIC TTG TAC AAA G 



C TG GGT CGA AAG AAC ATG |TTT C 
Asp Pro Ala Phe Leu Tyr Lys 



WO 00/52027 , ^ i PCT/US00/05432 



pENTR3C 2723 bp 

Location (Base Hos.) Gene Encoded 



67 . . 166 


attLl 


327 . .632 


ccdB 


661. . 760 


attL2 


883 . . 1692 


KmR 


1797 . .2370 


ori 



1 CTGACGGATG GCCTTTTTGC GTTTCTACAA ACTCTTCCTG TTAGTTAGTT ACTTAAGCTC 
61 GGGCCCCAAA TAATGATTTT ATTTTGACTG AT AGTGAC CT GTTCGTTGCA ACAAATTGAT 
121 AAGCAATGCT TTTTTATAAT GCCAACTTTG TACAAAAAAG CAGGCTCTTT AAAGGAAC C A 
181 ATTCAGTCGA CTGGATCCGG TACCGAATTC GATCGCTTAC TAAAAGCCAG ATAACAGTAT 
241 GCGTATTTGC GCGCTGATTT TTGCGGTATA AGAATATATA CTGATATGTA TAC CCGAAGT 
3 01 ATGTCAAAAA GAGGTGTGCT TCTAGAATGC AGTTTAAGGT TT AC AC C TAT AAAAGAGAGA 
361 GCCGTTATCG TCTGTTTGTG GATGTACAGA GTG AT AT TAT TGACACGCCC GGGCGACGGA 
421 TGGTGATCCC CCTGGCCAGT GCACGTCTGC TGTCAGATAA AGTCTCCCGT GAACTTTACC 
481 CGGTGGTGCA TATCGGGGAT GAAAGCTGGC GCATGATGAC CACCGATATG GCCAGTGTGC 
541 CGGTCTCCGT TATCGGGGAA GAAGTGGCTG ATCTCAGCCA CCGCGAAAAT GACATCAAAA 
6 01 ACGCCATTAA CCTGATGTTC TGGGGAATAT AGAATTCGCG GCCGCACTCG AGATATCTAG 
661 ACCCAGCTTT CTTGTACAAA GTTGGCATTA TAAGAAAGCA TTGCTT AT C A ATTTGTTGCA 
721 ACGAACAGGT CACTATCAGT CAAAATAAAA TCATTATTTG CCATCCAGCT GCAGCTCTGG 
781 CCCGTGTCTC AAAATCTCTG ATGTTACATT GCACAAGATA AAAAT AT AT C ATCATGAACA 
841 ATAAAACTGT C TG CT T AC AT AAACAGTAAT AC AAGGGGT G TTATGAGCCA TATTCAACGG 
901 GAAACGTCGA GGCCGCGATT AAATTCCAAC ATGGATGCTG ATTTATATGG GTATAAATGG 
961 GCTCGCGATA ATGTCGGGCA ATCAGGTGCG ACAATCTATC G CTTGT ATGG GAAGCCCGAT 
1021 GCGCCAGAGT TGTTTCTGAA ACATGGCAAA GGTAGCGTTG CCAATGATGT TACAGATGAG 
1081 ATGGTCAGAC TAAACTGGCT GACGGAATTT ATGCCTCTTC CGAC CATCAA GCATTTTATC 
1141 CGTACTCCTG ATGATGCATG GTTACTCACC ACTGCGATCC CCGGAAAAAC AGCATTCCAG 

12 01 GTATTAGAAG AATATCCTGA TTCAGGTGAA AATATTGTTG ATGCGCTGGC AGTGTTCCTG 
1261 CGCCGGTTGC ATTCGATTCC TGTTTGTAAT TGTCCTTTTA AC AG CG AT C G CGTATTTCGT 
1321 CTCGCTCAGG CGCAATCACG AATGAATAAC GGTTTGGTTG ATGCGAGTGA TT TTGATG AC 

13 81 GAGCGTAATG GCTGGCCTGT TGAACAAGTC TGGAAAGAAA TGCATAAACT TTTGCCATTC 
1441 TCACCGGATT CAGTCGTCAC T C ATGGTG AT TTCTCACTTG ATAACCTTAT TTTTGACGAG 
15 01 GGGAAATTAA TAGGTTGTAT TGATGTTGGA CGAGTCGGAA TCGCAGACCG ATACCAGGAT 
1561 CTTGCCATCC TATGGAACTG CCTCGGTGAG TTTTCTCCTT CATTACAGAA ACGGCTTTTT 
1621 C AAAAAT AT G GTATTGATAA TCCTGATATG AATAAATTGC AGTTTCATTT GATGCTCGAT 
1681 GAGTTTTTCT AATCAGAATT GGTTAATTGG TTGTAACATT ATTCAGATTG GGCCCCGTTC 
1741 CACTGAGCGT CAGACCCCGT AG AAAAG AT C AAAGG AT CT T CTTGAGATCC TTTTTTTCTG 
1801 CGCGTAATCT GCTGCTTGCA AACAAAAAAA CCACCGCTAC CAGCGGTGGT TTGTTTGCCG 
1861 GATCAAGAGC TACCAACTCT TTTTCCGAAG GTAACTGGCT TCAGCAGAGC G C AG AT AC C A 
1921 AATACTGTTC TT CTAGTGTA G C CGTAGTT A GGCCACCACT TCAAGAACTC T GT AG C AC C G 
1981 CCTACATACC TCGCTCTGCT AATCCTGTTA CCAGTGGCTG CTGCCAGTGG CGATAAGTCG 
2 041 TGTCTTACCG GGTTGGACTC AAG AC GAT AG TTACCGGATA AGGCGCAGCG GTCGGGCTGA 
2101 ACGGGGGGTT CGTGCACACA GCCCAGCTTG GAGCGAACGA CCTAC&CCGA AC TGAG AT AC 
2161 C T AC AG CG T G AG CT AT GAGA AAGCGCCACG CTTCCCGAAG GGAGAAAGGC GGACAGGTAT 
2221 CCGGTAAGCG GCAGGGTCGG AACAGGAGAG CGCACGAGGG AG C TT C C AGG GGGAAACGCC 
22 81 TGGTATCTTT ATAGTCCTGT CGGGTTTCGC CACCTCTGAC TTGAGCGTCG ATTTTTGTGA 
2 34T TGCTCGTCAG GGGGGCGGAG C C T ATGGAAA AACGCCAGCA ACGCGGCCTT TTTACGGTTC 
24 01 CTGGCCTTTT GCTGGCCTTT TGCTCACATG TTCTTTCCTG CGTTATCCCC TGATTCTGTG 

24 61 GATAAC CGTA TTACCGCTAG CATGGATCTC GGGGACGTCT AAC TAC T AAG CGAGAGTAGG 
2 521 GAACTGCCAG G CAT C AAAT A AAACG AAAGG CTCAGTCGGA AGACTGGGCC TTTCGTTTTA 

25 81 TCTGTTGTTT GTCGGTGAAC GCTCTCCTGA GTAGGACAAA TCCGCCGGGA GCGGATTTGA 
2 641 ACGTTGTGAA GCAACGGCCC GGAGGGTGGC GGGCAGGACG CCCGCCATAA ACTGCCAGGC 
27 01 ATCAAACTAA GCAGAAGGCC ATC 
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Figure 154: Cloning Sites of the Entry Vector pENTR4 



Int attLl 



Ncol Kozak XmnI Sail BamHI 

TljG_JI^C_AAA_ AAA GCA GGC ifcc A( jc ATG GGA 'aCC 
TTT CGT CCG Ap G TGG TACj^CCT TGG 



AAC ATG TTT 



AAT TCA g }tC GAC TG b ATC CGG 



TTA AGT CAG CT3 ACC TAG 



Leu Tyr Lys Lys Ala Gly Ser Thr Met Gly Thr Asn Ser Val Asp Trp lie Arg 



Kpnl EcoRI 

TAC I Ct jA ATT C - - 



EcoRI 



NotI 



Xhol 



EcoRV Xbal 



ATG GCT TAA 



ccdB 



Tyr Arg lie 



C TTA 



^GC GCC GGjC 



GTG AGC 



f 1 



V 



IcTA G 


A.C CCA GCT 
TG GGT CGA 


GAT C 

N 


V 



Int 



attL2 



T TC TTG TAC AAA G 
AAG AAC ATG iTTT C 

Phe Leu Tyr Lys 



WO 00/52027 _ , PCT/US00/05432 



pENTR4 2720 bp 



Location (Base Nos. ) 
67 . . 166 
324 . . 629 
658 . . 757 
880 . . 1689 
1794. .2367 



Gene Encoded 

attLl 

ccdB 

attL2 

KmR 

ori 



1 CTGACGGATG GCCTTTTTGC GTTTCTACAA 
61 GGGCCCCAAA TAATGATTTT ATTTTGACTG 
121 AAGCAATGCT TTTTTATAAT GCCAACTTTG 
181 AATTCAGTCG ACTGGATCCG GTACCGAATT 
241 TATTTGCGCG CTGATTTTTG CGGTATAAGA 
3 01 TCAAAAAGAG GTGTGCTTCT AGAATGCAGT 
361 GTTATCGTCT GTTTGTGGAT GTACAGAGTG 
421 TGATCCCCCT GGCCAGTGCA CGTCTGCTGT 
481 TGGTGCATAT CGGGGATGAA AGCTGGCGCA 
541 TCTCCGTTAT CGGGGAAGAA GTGGCTGATC 
6 01 CCATTAACCT GATGTTCTGG GGAATATAGA 
661 CAGCTTTCTT GTACAAAGTT GGCATTATAA 
721 AACAGGTCAC TATCAGTCAA AATAAAATCA 
781 GTGTCTCAAA ATCTCTGATG TTACATTGCA 
841 AAACTGTCTG CTTACATAAA CAGTAATACA 
901 ACGTCGAGGC CGCGATTAAA TTCCAACATG 
961 CGCGATAATG TCGGGCAATC AGGTG C GAC A 
1021 CCAGAGTTGT TTCTGAAACA TGGCAAAGGT 
10 81 G T C AG AC T AA ACTGGCTGAC GGAATTTATG 
1141 ACTCCTGGTG ATGCATGGTT ACTCACCACT 
12 01 TTAGAAGAAT ATCCTGATTC AGGTGAAAAT 

12 61 CGGTTGCATT CGATTCCTGT TTGTAATTGT 

13 21 GCTCAGGCGC AATCACGAAT GAATAACGGT 

13 81 CGTAATGGCT GGCCTGTTGA ACAAGTCTGG 

14 41 CCGGATTCAG TCGTCACTCA TGGTGATTTC 

15 01 AAATTAATAG GTTGTATTGA TGTTGGACGA 

15 61 G C C AT C CT AT GGAACTGCCT CGGTGAGTTT 

16 21 AAATATG GT A TTGATAATCC TGATATGAAT 
16 81 TTTTTCTAAT CAGAATTGGT TAATTGGTTG 
1741 TGAGCGTCAG ACCCCGTAGA AAAG AT C AAA 
18 01 GTAATCTGCT GCTTGCAAAC AAAAAAACCA 
18 61 C AAGAG CT AC CAACTCTTTT TCCGAAGGTA 
1921 ACTGTTCTTC TAGTGTAGCC GTAGTTAGGC 
1981 ACATACCTCG CTCTGCTAAT CCTGTTACCA 
2 041 CTTACCGGGT TGGACTCAAG ACGATAGTTA 
2101 GGGGGTTCGT GCACACAGCC CAGCTTGGAG 
2161 C AG CG T GAG C TATGAGAAAG CGCCACGCTT 
2221 GTAAGCGGCA GGGTCGGAAC AGGAGAGCGC 
22 81 TATCTTTATA GTCCTGTCGG GTTTCGCCAC 
2 341 TCGTCAGGGG GGCGGAGCCT ATGGAAAAAC 
24 01 GCCTTTTGCT GGCCTTTTGC TCACATGTTC 
24 61 AACCGTATTA CCGCTAGCAT GGATCTCGGG 
2 5 21 CTGCCAGGCA TCAAATAAAA CGAAAGGCTC 
2 5 81 GTTGTTTGTC GGTGAACGCT CTCCTGAGTA 
2 641 TTGTGAAGCA ACGGCCCGGA GGGTGGCGGG 
2 7 01 AAACTAAGCA G AAGG C CAT C 



ACTCTTCCTG TTAGTTAGTT ACTTAAGCTC 
ATAGTGACCT GTTCGTTGCA ACAAATTGAT 
TACAAAAAAG CAGGCTCCAC CATGGGAACC 
CGCTTACTAA AAG C C AG AT A ACAGTATGCG 
ATATATACTG AT ATG TAT AC CCGAAGTATG 
TTAAGGTTTA C AC C TAT AAA AG AGAGAGC C 
AT AT TATTGA CACGCCCGGG CGACGGATGG 
CAGATAAAGT CTCCCGTGAA CTTTACCCGG 
T GAT GAC C AC CGATATGGCC AGTGTGCCGG 
TCAGCCACCG CGAAAATGAC AT C AAAAAC G 
ATTCGCGGCC GCACTCGAGA TAT CTAGAC C 
GAAAGCATTG CTTATCAATT TGTTGCAACG 
TTATTTGCCA TCCAGCTGCA GCTCTGGCCC 
CAAGATAAAA AT AT AT CAT C ATGAACAATA 
AGGGGTGTTA TGAG C CAT AT TCAACGGGAA 
GATG CTGATT TATATGGGTA TAAATGGGCT 
ATCTATCGCT TGTATGGGAA GCCCGATGCG 
AGCGTTGCCA ATGATGTTAC AGATGAGATG 
CCTCTTCCGA CCATCAAGCA TTTTATCCGT 
GCGATCCCCG G AAAAAC AG C ATT C CAGGT A 
ATTGTTGATG CGCTGGCAGT GTTCCTGCGC 
CCTTTTAACA GCGATCGCGT ATTTCGTCTC 
TTGGTTGATG C G AGTGATT T TGATGACGAG 
AAAG AAATG C ATAAACTTTT GCCATTCTCA 
TCACTTGATA ACCTTATTTT TGACGAGGGG 
GTCGGAATCG CAGACCGATA CCAGGATCTT 
TCTCCTTCAT TACAGAAACG GCTTTTTCAA 
AAATTGCAGT TTCATTTGAT GCTCGATGAG 
T AAC ATT AT T CAGATTGGGC CCCGTTCCAC 
GGATCTTCTT GAGATCCTTT TTTTCTGCGC 
CCGCTACCAG CGGTGGTTTG TTTGCCGGAT 
ACTGGCTTCA G CAGAGCG C A GAT AC C AAAT 
CACCACTTCA AGAACTCTGT AGCACCGCCT 
GTGGCTGCTG CCAGTGGCGA TAAGTCGTGT 
C CGGAT AAGG CGCAGCGGTC GGGCTGAACG 
CGAACGACCT ACACCGAACT GAGATACCTA 
C C CGAAGGG A GAAAGGCGGA CAGGTATCCG 
ACGAGGGAGC TTCCAGGGGG AAACGCCTGG 
CTCTGACTTG AGCGTCGATT TTTGTGATGC 
GCCAGCAACG CGGCCTTTTT ACGGTTCCTG 
TTTCCTGCGT TATCCCCTGA TTCTGTGGAT 
GACGTCTAAC T AC T AAG C G A GAG TAG G G AA 
AGTCGGAAGA CTGGGCCTTT CGTTTTATCT 
GGACAAATCC GCCGGGAGCG G ATT T G AAC G 
CAGGACGCCC GCCATAAACT GCCAGGCATC 



WO 00/52027 / PCT/US00/05432 



Figure / ^Cloning sites of the t4^y Vector ^&W-S 



r >tjg ta c aaa aaa gca ggc tfct ca^atg gga rScc\ aat tea gfec 
</^jr&/ftt^ fe^tH ttt cot ccg aW a gfcat^c cct tgg^tta agt cag 

^ T ^ 4> AU *e VWs M£ 6tj TV A,„ S*- V*( 



gac tabatc ca g tacl cg W att cgc — - D«ath --- ag£_att cgc 
acctag~^cjc atg get taa^gcg — (cedB)— tct taajgeg. 

fief Trp ">V ^ D e 



N-jirX >*ioX e«^5Z: Sit " _~ ' — 

P ge c gc a cfc cga gat at fc tag |acc cag ecu ^ ^^/^V'/ . 
ccg g£tg tgn g — -t-^lji-nn °tc °aa aaa aca tt/f -*w 



WO 00/52027 \Q\ s PCT/USO 0/05432 



pENTRS 2720 bp 

Location (Base Nos . ) Gene Encoded 

67 . . 166 attLl 

324 . . 629 ccdB 

658 . . 757 attL2 

880 . . 1689 KmR 

1794.. 2367 ori 

1 CTGACGGATG GCCTTTTTGC GTTTCTACAA ACTCTTCCTG TTAGTTAGTT ACTTAAGCTC 

61 GGGCCCCAAA TAATGATTTT ATTTTGACTG ATAGTGACCT GTTCGTTGCA ACAAATTGAT 

121 AAGCAATGCT TTTTTATAAT GCCAACTTTG TACAAAAAAG CAGGCTTTCA TATGGGAACC 

181 AATTCAGTCG ACTGGATCCG GTACCGAATT CGCTTACTAA AAGCCAGATA ACAGTATGCG 

241 TATTTGCGCG CTGATTTTTG CGGTATAAGA ATATATACTG ATATGTATAC CCGAAGTATG 

3 01 T C AAAAAG AG GTGTGCTTCT AGAATGCAGT TTAAGGTTTA CACCTATAAA AG AG AGAGC C 

3 61 GTTATCGTCT GTTTGTGGAT GTACAGAGTG ATATTATTGA CACGCCCGGG CGACGGATGG 
421 TGATCCCCCT GGCCAGTGCA CGTCTGCTGT CAGATAAAGT CTCCCGTGAA CTTTACCCGG 

4 81 TG GT G CAT AT CGGGGATGAA AGCTGGCGCA TGATGACCAC CGATATGGCC AGTGTGCCGG 
541 TCTCCGTTAT CGGGGAAGAA GTGGCTGATC TCAGCCACCG CGAAAATGAC AT C AAAAAC G 
6 01 CCATTAACCT GATGTTCTGG GGAATATAGA ATTCGCGGCC GCACTCGAGA TAT C TAG AC C 
6 61 CAGCTTTCTT GTACAAAGTT GGCATTATAA G AAAG CAT TG CTTATCAATT TGTTGCAACG 
721 AACAGGTCAC TATCAGTCAA AATAAAATCA TTATTTGCCA TCCAGCTGCA GCTCTGGCCC 
781 GTGTCTCAAA ATCTCTGATG TTACATTGCA CAAGATAAAA AT AT AT CAT C ATGAACAATA 
841 AAACTGTCTG CTTACATAAA CAGTAATACA AGGGGTGTTA T GAG C CAT AT TCAACGGGAA 
9 01 ACGTCGAGGC CGCGATTAAA TTCCAACATG GATGCTGATT TAT AT G G GT A TAAATGGGCT 
961 CGCGATAATG TCGGGCAATC AGGTGCGACA ATCTATCGCT TGTATGGGAA GCCCGATGCG 

1021 CCAGAGTTGT TTCTGAAACA TGGCAAAGGT AGCGTTGCCA ATGATGTTAC AGATGAGATG 

1081 GTCAGACTAA ACTGGCTGAC GGAATTTATG CCTCTTCCGA C CAT C AAG C A TTTTATCCGT 

1141 ACTCCTGATG ATGCATGGTT ACTCACCACT GCGATCCCCG GAAAAACAGC ATTCCAGGTA 

12 01 TTAGAAGAAT ATCCTGATTC AGG TGAAAAT ATTGTTGATG CGCTGGCAGT GTTCCTGCGC 

12 61 CGGTTGCATT CGATTCCTGT TTGTAATTGT C CTTTTAAC A GCGATCGCGT ATTTCGTCTC 

13 21 GCTCAGGCGC AATCACGAAT GAATAACGGT TTGGTTGATG CGAGTGATTT TG AT G AC GAG 
13 81 CGTAATGGCT GGCCTGTTGA ACAAGTCTGG AAAG AAATG C ATAAACTTTT GCCATTCTCA 
1441 CCGGATTCAG TCGTCACTCA TGGTGATTTC TCACTTGATA ACCTTATTTT TGACGAGGGG 

15 01 AAATTAATAG GTTGTATTGA TGTTGGACGA GTCGGAATCG CAGACCGATA CCAGGATCTT 
1561 GCCATCCTAT GGAACTGCCT CGGTGAGTTT TCTCCTTCAT T AC AG AAAC G GCTTTTTCAA 
1621 AAATATGGTA TTGATAATCC TGATATGAAT AAATTG CAGT TTCATTTGAT GCTCGATGAG 

16 81 TTTTTCTAAT CAGAATTGGT TAATTGGTTG T AAC ATT AT T CAGATTGGGC CCCGTTCCAC 
1741 TGAGCGTCAG ACCCCGTAGA AAAGATCAAA GGATCTTCTT GAGATCCTTT TTTTCTGCGC 
18 01 GTAATCTGCT GCTTGCAAAC AAAAAAACCA CCGCTACCAG CGGTGGTTTG TTTGCCGGAT 
1861 C AAG AG CT AC CAACTCTTTT TCCGAAGGTA ACTGGCTTCA GCAGAGCGCA GATACCAAAT 
1921 ACTGTTCTTC TAGTGTAGCC GTAGTTAGGC CACCACTTCA AGAACTCTGT AGCACCGCCT 
1981 ACATACCTCG CTCTGCTAAT CCTGTTACCA GTGGCTGCTG CCAGTGGCGA TAAGTCGTGT 
2 041 CTTACCGGGT TG G AC T C AAG ACGATAGTTA CCGGATAAGG CGCAGCGGTC GGGCTGAACG 
2101 GGGGGTTCGT GCACACAGCC CAGCTTGGAG C G AAC G AC C T ACACCGAACT GAGATACCTA 
2161 C AG C G TG AG C TATGAGAAAG CGCCACGCTT CCCGAAGGGA GAAAGGCGGA CAGGTATCCG 
2 221 G-CAAGCGGCA GGGTCGGAAC AGGAGAGCGC ACGAGGGAGC TTCCAGGGGG AAACGCCTGG 
2 2 81 TAT C T T TATA GTCCTGTCGG GTTTCGCCAC CTCTGACTTG AGCGTCGATT TTTGTGATGC 
2 341 TCGTCAGGGG GGCGGAGCCT ATGGAAAAAC GCCAGCAACG CGGCCTTTTT ACGGTTCCTG 
2 4 01 GCCTTTTGCT GGCCTTTTGC TCACATGTTC TTTCCTGCGT TATCCCCTGA TTCTGTGGAT 
2 461 AAC C GT AT T A CCGCTAGCAT GGATCTCGGG GACGTCTAAC TACTAAGCGA GAG T AGG G AA 
2 521 CTGCCAGGCA TCGAATAAAA CGAAAGGCTC AGTCGGAAGA CTGGGCCTTT CGTTTTATCT 
2 581 GTTGTTTGTC GGTGAACGCT CTCCTGAGTA GGACAAATCC GCCGGGAGCG GATTTGAACG 
2 641 TTGTGAAGCA ACGGCCCGGA GGGTGGCGGG CAGGACGCCC G C CAT AAAC T GCCAGGCATC 
2 701 AAAC T AAG C A G AAGG C CAT C 
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Figure / !>M Cloning sites of the t>n t*¥ Vector p 6^ k 



yVS ^P ^ ac aaa aa * g ga ggg t frc atqrj cqa Gi&laat tea Jbc 
^^✓arfcTHTOxcgd tt t cgt ccg afo r target: tggrltta agt cag" 

Uu Tc^ U, s A* CU, C y * Met ^ -pv- *As* 5<y 
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J? 
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pENTR6 2717 bp 

Location (Base Nos . ) Gene Encoded 

67 . . 166 attLl 

321. .626 ccdB 

655 . . 754 attL2 

877 . . 1686 KmR 

1791. .2364 ori 

1 CTGACGGATG GCCTTTTTGC GTTTCTACAA ACTCTTCCTG TTAGTTAGTT ACTTAAGCTC 

61 GGGCCCCAAA TAATGATTTT ATTTTGACTG ATAGTGACCT GTTCGTTGCA ACAAATTGAT 

121 AAGCAATGCT TTTTTATAAT GCCAACTTTG TACAAAAAAG CAGGCTGCAT GCGAACCAAT 

181 TCAGTCGACT GGATCCGGTA CCGAATTCGC TTACTAAAAG C C AGATAAC A GTATGCGTAT 

241 TTGCGCGCTG ATTTTTGCGG TATAAGAATA TATACTGATA TGTATACCCG AAGTATGTCA 

3 01 AAAAGAGGTG TGCTTCTAGA ATGCAGTTTA AGGTTTACAC C T AT AAAAG A GAG AG C C GT T 

3 61 ATCGTCTGTT TGTGGATGTA CAGAGTGATA TTATTGACAC GCCCGGGCGA CGGATGGTGA 

421 TCCCCCTGGC CAGTGCACGT CTGCTGTCAG ATAAAGTCTC CCGTGAACTT TACCCGGTGG 

481 TG C AT AT C G G GGATGAAAGC TGGCGCATGA TGACCACCGA TATGGCCAGT GTGCCGGTCT 

541 CCGTTATCGG GGAAGAAGTG GCTGATCTCA GCCACCGCGA AAATGAC AT C AAAAACGCCA 

6 01 T T AAC C TG AT GTTCTGGGGA ATATAGAATT CGCGGCCGCA CTCGAGATAT C TAG AC C C AG 

661 CTTTCTTGTA CAAAGTTGGC ATTATAAGAA AGCATTGCTT ATCAATTTGT TGCAACGAAC 

721 AGGTCACTAT CAGTCAAAAT AAAAT C ATT A TTTGCCATCC AGCTGCAGCT CTGGCCCGTG 

781 TCTCAAAATC TCTGATGTTA CATTGCACAA GATAAAAATA TAT CAT C ATG AACAATAAAA 

841 CTGTCTGCTT ACATAAACAG TAATACAAGG GGTGTTATGA G C CAT ATT C A ACGGGAAACG 

901 TCGAGGCCGC GATTAAATTC CAACATGGAT GCTGATTTAT ATGGGTATAA ATGGGCTCGC 

961 GATAATGTCG GGCAATCAGG TGCGACAATC TATCGCTTGT ATGGGAAGCC CGATGCGCCA 

1021 GAGTTGTTTC TGAAACATGG CAAAGGTAGC GTTGCCAATG ATGTTACAGA TGAGATGGTC 

10 81 AGACTAAACT GGCTGACGGA ATTTATGCCT CTTCCGACCA TCAAGCATTT TATCCGTACT 

1141 CCTGATGATG CATGGTTACT CACCACTGCG ATCCCCGGAA AAAC AG C ATT CCAGGTATTA 

12 01 GAAGAATATC CTGATTCAGG TG AAAAT AT T GTTGATGCGC TGGCAGTGTT CCTGCGCCGG 

12 61 TTGCATTCGA TTCCTGTTTG TAATTGTCCT TTTAACAGCG ATCGCGTATT TCGTCTCGCT 
1321 CAGGCG CAAT CACGAATGAA TAACGGTTTG GTTGATGCGA GTGATTTTGA TGACGAGCGT 

13 81 AATGGCTGGC CTGTTGAACA AGTCTGGAAA G AAAT G CAT A AACTTTTGCC ATTCTCACCG 
1441 GATTCAGTCG TCACTCATGG TGATTTCTCA CTTGATAACC TTATTTTTGA CGAGGGGAAA 
15 01 TTAATAGGTT G TAT T G ATG T TGGACGAGTC GGAATCGCAG ACCGATACCA GGATCTTGCC 

15 61 ATC CTATGG A ACTGCCTCGG TGAGTTTTCT C CT T CAT T AC AG AAAC GG C T TTTTCAAAAA 
1621 TATGGTATTG ATAATCCTGA TATGAATAAA TTGCAGTTTC ATTTGATGCT CGATGAGTTT 

16 81 TTCTAATCAG AATTGGTTAA TTGGTTGTAA CAT T ATT C AG ATTGGGCCCC GTTCCACTGA 
1741 GCGTCAGACC CCGTAGAAAA GATCAAAGGA TCTTCTTGAG ATCCTTTTTT TCTGCGCGTA 
18 01 ATCTGCTGCT T G C AAAC AAA AAAACCACCG CTACCAGCGG TGGTTTGTTT GCCGGATCAA 

18 61 GAGCTACCAA CTCTTTTTCC GAAGGTAACT GGCTTCAGCA GAGCGCAGAT ACCAAATACT 
1921 GTTCTTCTAG TGTAGCCGTA GTTAGGCCAC CACTTCAAGA ACTCTGTAGC ACCGCCTACA 

19 81 TACCTCGCTC TGCTAATCCT GTTACCAGTG GCTGCTGCCA GTGGCGATAA GTCGTGTCTT 
2 041 ACCGGGTTGG ACTCAAGACG ATAGTTACCG GATAAGGCGC AGCGGTCGGG CTGAACGGGG 
2101 GGTTCGTGCA CACAGCCCAG CTTGGAGCGA ACGACCTACA CCGAACTGAG ATAC CTACAG 
2161 CGTGAG CTAT GAGAAAGCGC CACGCTTCCC GAAGGGAGAA AGGCGGACAG GTATCCGGTA 
22 21 AGCGGCAGGG TCGGAACAGG AGAGCGCACG AGGGAGCTTC CAGGGGGAAA CGCCTGGTAT 
22 81 CTTTATAGTC CTGTCGGGTT TCGCCACCTC TGACTTGAGC GTCGATTTTT GTGATG CTCG 
2 3 4 ; 1 TCAGGGGGGC GGAGCCTATG GAAAAACGCC AGCAACGCGG CCTTTTTACG GTTCCTGGCC 
24 01 TTTTGCTGGC CTTTTGCTCA CATGTTCTTT CCTGCGTTAT CCCCTGATTC TGTGGATAAC 
24 61 C G T ATT AC C G CTAGCATGGA TCTCGGGGAC GTCTAACTAC TAAGCGAGAG TAGGGAACTG 
2 521 C C AG G CAT C A AATAAAACGA AAGGCTCAGT CGGAAGACTG GGCCTTTCGT TTTATCTGTT 
2 5 81 GTTTGTCGGT GAACGCTCTC CTGAG TAG G A CAAATCCGCC GGGAGCGGAT TTGAACGTTG 
2 641 TGAAGCAACG GCCCGGAGGG TGGCGGGCAG GACGCCCGCC ATAAACTGCC AGG CATC AAA 
2 7 01 CTAAGCAGAA GGCCATC 
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Figure/^ Cloning sites of the ■ ghf?Q/ Vector ^6^-1 
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pENTR7 2738 bp 



Location (Base Nos. 
67 . . 166 
342 . . 647 
676 . . 775 
898 . . 1707 
1812 . .2385 



Gene Encoded 

attLl 

ccdB 

attL2 

KmR 

ori 



1 CTGACGGATG GCCTTTTTGC GTTTCTACAA ACTCTTCCTG TTAGTTAGTT ACTTAAGCTC 
61 GGGCCCCAAA TAATGATTTT ATTTTGACTG ATAGTGACCT GTTCGTTGCA ACAAATTGAT 
121 AAGCAATGCT TTTTTATAAT GCCAACTTTG TACAAAAAAG CAGGCTTTGA AAACCTGTAT 
181 TTTCAAGGAA CCGTTTCATG CATCGTCGAC TGGATCCGGT ACCGAATTCG CTTACTAAAA 
241 G C C AG AT AAC AGTATGCGTA TTTGCGCGCT GATTTTTGCG GTATAAGAAT ATATACTGAT 
3 01 ATGTATACCC GAAGTATGTC AAAAAGAGGT GTGCTTCTAG AATGCAGTTT AAGGTTTACA 
3 61 CCTATAAAAG AG AG AG C C GT TATCGTCTGT TTGTGGATGT ACAGAGTGAT ATTATTGACA 
421 CGCCCGGGCG ACGGATAGTG ATCCCCCTGG CCAGTGCACG TCTGCTGTCA GATAAAGTCT 
481 CCCGTGAACT TTACCCGGTG GTGCATATCG GGGATGAAAG CTGGCGCATG ATGACCACCG 
541 ATATGG C C AG TGTGCCGGTC TCCGTTATCG GGGAAGAAGT GGCTGATCTC AGCCACCGCG 
6 01 AAAATGACAT CAAAAACGCC ATTAACCTGA TGTTCTGGGG AATATAGAAT TCGCGGCCGC 
661 ACTCGAGATA T C TAG AC C C A GCTTTCTTGT ACAAAGTTGG CATTATAAGA AAGCATTGCT 
721 TATCAATTTG TTGCAACGAA CAGGTCACTA TCAGTCAAAA TAAAAT C ATT ATTTGCCATC 
781 CAGCTGCAGC TCTGGCCCGT GTCTCAAAAT CTCTGATGTT ACATTGCACA AGATAAAAAT 
841 ATATCATCAT GAACAATAAA ACTGTCTGCT TACATAAACA GTAATACAAG GGGTGTTATG 
9 01 AG C CAT AT T C AACGGGAAAC GTCGAGGCCG CGATTAAATT CCAACATGGA TGCTGATTTA 
961 TATGGGTATA AATGGGCTCG CGATAATGTC GGGCAATCAG GTGCGACAAT CTATCGCTTG 
1021 TATGGGAAGC CCGATGCGCC AGAGTTGTTT CTGAAACATG G C AAAGGT AG CGTTGCCAAT 
10 81 GATGTTACAG ATGAGATGGT CAGACTAAAC TGGCTGACGG AATTTATGCC TCTTCCGACC 
1141 ATCAAG CATT TTATCCGTAC TCCTGATGAT GCATGGTTAC TCACCACTGC GATCCCCGGA 
12 01 AAAAC AG CAT TCCAGGTATT AGAAGAATAT CCTGATTCAG GTGAAAATAT TGTTGATGCG 

12 61 CTGGCAGTGT TCCTGCGCCG GTTGCATTCG ATTCCTGTTT GTAATTGTCC TTTTAACAGC 

13 21 GATCGCGTAT TTCGTCTCGC TCAGGCGCAA TCACGAATGA ATAACGGTTT GGTTGATGCG 
13 81 AGTGATTTTG ATGACGAGCG TAATGGCTGG CCTGTTGAAC AAGTCTGGAA AGAAATG C AT 
1441 AAACTTTTGC CATTCTCACC GGATTCAGTC GT C AC T CAT G GTGATTTCTC ACTTGATAAC 

15 01 CTTATTTTTG ACGAGGGGAA ATTAATAGGT TGTATTGATG TTGGACGAGT CGGAATCGCA 
1561 G AC C GAT AC C AGGATCTTGC CATCCTATGG AACTGCCTCG GTGAGTTTTC TCCTTCATTA 
1621 CAGAAACGGC TTTTTCAAAA ATATGGTATT GATAATCCTG ATATGAATAA ATTGCAGTTT 

16 81 CATTTGATGC TCGATGAGTT TTTCTAATCA GAATTGGTTA ATTGGTTGTA AC ATT ATT C A 
1741 GATTGGGCCC CGTTCCACTG AGCGTCAGAC C C CGTAG AAA AGATCAAAGG ATCTTCTTGA 
18 01 GATCCTTTTT " TTCTGCGCGT AATCTGCTGC TTGCAAACAA AAAAAC C AC C GCTACCAGCG 

18 61 GTGGTTTGTT TGCCGGATCA AGAGCTACCA ACTCTTTTTC CGAAGGTAAC TGGCTTCAGC 
1921 AG AG CG C AG A TACCAAATAC TGTTCTTCTA GTGTAGCCGT AGTTAGG C C A CCACTTCAAG 

19 81 AACTCTGTAG CACCGCCTAC ATACCTCGCT CTGCTAATCC TGTTACCAGT GGCTGCTGCC 
2 041 AGTGGCGATA AGTCGTGTCT TACCGGGTTG GACTCAAGAC GATAGTTACC GGATAAGGCG 

.2101 CAGCGGTCGG GCTGAACGGG GGGTTCGTGC ACACAGCCCA GCTTGGAGCG AACGACCTAC 
2161 ACCGAACTGA GATACCTACA GCGTGAGCTA TGAGAAAGCG CCACGCTTCC CGAAGGGAGA 
22 21 AAGGCGGACA GGTATCCGGT AAGCGGCAGG GTCGGAACAG GAGAGCGCAC GAGGGAGCTT 
22 81 CCAGGGGGAA ACGCCTGGTA TCTTTATAGT CCTGTCGGGT TTCGCCACCT CTGACTTGAG 
2 34X CGTCGATTTT TGTGATGCTC GTCAGGGGGG CGGAGCCTAT GGAAAAACGC C AG C AAC G C G 
24 01 GCCTTTTTAC GGTTCCTGGC CTTTTGCTGG CCTTTTGCTC ACATGTTCTT TCCTGCGTTA 

24 61 TCCCCTGATT CTGTGGATAA C C GT ATT AC C GCTAGCATGG ATCTCGGGGA CGTCTAACTA 

25 21 CTAAGCGAGA GTAG GGAACT GCCAGGCATC AAATAAAACG AAAGG C T C AG TCGGAAGACT 
25 81 GGGCCTTTCG TTTTATCTGT TGTTTGTCGG TGAACGCTCT CCTGAGTAGG ACAAATCCGC 
2 641 CGGGAGCGGA TTTGAACGTT GTGAAGCAAC GGCCCGGAGG GTGGCGGGCA GGACGCCCGC 
2 7 01 CATAAACTGC CAGGCATCAA ACTAAGCAGA AGGCCATC 
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Figure HA : Cloning Sites of the 6 A t& Vector < f 
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WO 00/52027 



PCT/US00/05432 



pENTR8 2735 bp 



Location (Base Nos, 
67 . . 166 
339 . . 644 
673 . . 772 
895 . . 1704 
1809 . .2382 



Gene Encoded 

attLl 

ccdB 

attL2 

KmR 

ori 



1 CTGACGGATG GCCTTTTTGC GTTTCTACAA 
61 GGGCCCCAAA TAATGATTTT ATTTTGACTG 
121 AAGCAATGCT TTTTTATAAT GCCAACTTTG 
181 TTTCAAGGAA CCATGGACCT AGTCGACTGG 
241 AGATAACAGT ATGCGTATTT GCGCGCTGAT 

3 01 TAT AC C C G AA GTATGTCAAA AAGAGGTGTG 
361 ATAAAAGAGA GAG C CGTTAT CGTCTGTTTG 
421 CCGGGCGACG GAT AG TG AT C CCCCTGGCCA 

4 81 GTGAACTTTA CCCGGTGGTG CATATCGGGG 
541 TGGCCAGTGT GCCGGTCTCC GTTATCGGGG 

6 01 ATGACATCAA AAACG C CAT T AACCTGATGT 
661 C GAG AT AT C T AG AC C C AG C T TTCTTGTACA 
721 CAATTTGTTG CAACGAACAG GTCACTATCA 

7 81 CTGCAGCTCT GGCCCGTGTC TCAAAATCTC 
841 TCATCATGAA CAATAAAACT GTCTGCTTAC 
901 CATATTCAAC GGGAAACGTC GAGGCCGCGA 
961 GGGTATAAAT GGGCTCGCGA TAATGTCGGG 

1021 GGGAAGCCCG ATGCGCCAGA GTTGTTTCTG 
1081 GTTACAGATG AGATGGTCAG ACTAAACTGG 
1141 AAGCATTTTA TCCGTACTCC TG ATGATG C A 
12 01 AC AG C ATT C C AGGTATTAGA AGAATATCCT 

12 61 GCAGTGTCCC TGCGCCGGTT GCATTCGATT 

13 21 CGCGTATTTC GTCTCGCTCA GGCGCAATCA 
13 81 GATTTTGATG ACGAGCGTAA TGGCTGGCCT 
1441 CTTTTGCCAT TCTCACCGGA TTCAGTCGTC 
15 01 ATTTTTGACG AGGGGAAATT AATAGGTTGT 

15 61 CGAT AC C AG G ATCTTGCCAT CCTATGGAAC 
1621 AAACGGCTTT TTCAAAAATA TGGT AT TG AT 

16 81 TTGATGCTCG ATGAGTTTTT CTAATCAGAA 
1741 TGGGCCCCGT TCCACTGAGC GTCAGACCCC 
18 01 CCTTTTTTTC TGCGCGTAAT CTGCTGCTTG 
18 61 GTTTGTTTGC CGGATCAAGA GCTACCAACT 
1921 G C G C AG AT AC CAAATACTGT TCTTCTAGTG 
1981 TCTGTAGCAC CGCCTACATA CCTCGCTCTG 
2 041 GGCGATAAGT CGTGTCTTAC CGGGTTGGAC 
2101 CGGTCGGGCT GAACGGGGGG TTCGTGCACA 
2161 G AAC T GAG AT ACCTACAGCG TGAGCTATGA 
22 21 GCGGACAGGT ATCCGGTAAG CGGCAGGGTC 
22 81 GGGGGAAACG CCTGGTATCT TTATAGTCCT 
2 3 41 CGATTTTTGT GATGCTCGTC AGGGGGGCGG 
24 01 TTTTTACGGT TCCTGGCCTT TTGCTGGCCT 
24 61 CCTGATTCTG TGGATAACCG TATTACCGCT 
2 5 21 AG C GAG AG T A GGGAACTGCC AGGCATCAAA 
2 581 CCTTTCGTTT TATCTGTTGT TTGTCGGTGA 
2 6 41 GAG CG GAT T T GAACGTTGTG AAGCAACGGC 
2 701 AAACTGCCAG G CAT C AAAC T AAGCAGAAGG 



ACTCTTCCTG TTAGTTAGTT ACTTAAGCTC 

ATAGTGACCT GTTCGTTGCA ACAAATTGAT 

TACAAAAAAG CAGGCTTTGA AAACCTGTAT 

ATCCGGTACC GAATTCGCTT AC TAAAAGC C 

TTTTGCGGTA TAAGAATATA TACTGATATG 

CTTCTAGAAT GCAGTTTAAG GTTTACACCT 

TGGATGTACA GAGTGATATT ATTGACACGC 

GTGCACGTCT GCTGTCAGAT AAAGTCTCCC 

ATGAAAGCTG GCGC ATGATG ACCACCGATA 

AAGAAGTGGC TG AT CT C AG C C AC CGCGAAA 

TCTGGGGAAT ATAGAATTCG CGGCCGCACT 

AAGTTGGCAT TATAAGAAAG CATTGCTTAT 

GTCAAAATAA AATCATTATT TGCCATCCAG 

TGATGTTACA TTGCACAAGA TAAAAATATA 

ATAAACAGTA ATACAAGGGG TGTTATGAGC 

TTAAATT C C A ACATGGATGC TGATTTATAT 

CAATCAGGTG CGACAATCTA TCGCTTGTAT 

AAACATGGCA AAGGTAG C GT TGCCAATGAT 

CTGACGGAAT TTATGCCTCT TCCGACCATC 

TGGTTACTCA C C ACT G C GAT CCCCGGAAAA 

GATTCAGGTG AAAATATTGT TGATGCGCTG 

CCTGTTTGTA ATTGTCCTTT T AAC AG CGAT 

C GAATGAAT A ACGGTTTGGT TGATGCGAGT 

GTTGAACAAG T C TGGAAAGA AAT G CAT AAA 

ACTCATGGTG ATTTCTCACT TGATAACCTT 

ATTGATGTTG GACGAGTCGG AAT C G C AG AC 

TGCCTCGGTG AGTTTTCTCC T T CAT T AC AG 

AAT C C TG AT A TGAATAAATT GCAGTTTCAT 

TTGGTTAATT GGTTGTAACA TT AT T C AG AT 

GTAGAAAAGA TCAAAGGATC TT CTTGAGAT 

CAAACAAAAA AACCACCGCT ACCAGCGGTG 

CTTTTTCCGA AGGTAACTGG CTTCAGCAGA 

TAG C CG TAGT TAGGCCACCA CTTCAAGAAC 

CTAATCCTGT TACCAGTGGC TGCTGCCAGT 

TCAAGACGAT AGTTACCGGA TAAGGCGCAG 

CAGCCCAGCT TGGAGCGAAC GACCTACACC 

GAAAGCGCCA CGCTTCCCGA AGGGAGAAAG 

GGAACAGGAG AGCGCACGAG GGAGCTTCCA 

GTCGGGTTTC GCCACCTCTG ACTTGAGCGT 

AGCCTATGGA AAAACGC C AG CAACGCGGCC 

TTTGCTCACA TGTTCTTTCC TGCGTTATCC 

AGCATGGATC TCGGGGACGT C T AAC T AC T A 

TAAAACGAAA GGCTCAGTCG GAAGACTGGG 

ACGCTCTCCT GAGTAGGACA AATCCGCCGG 

CCGGAGGGTG GCGGGCAGGA CGCCCGCCAT 
CCATC 



WO 00/52027 
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Figure Cloning sites of the bl-Hy Vector f&Jfc$ 




X£ ^Ll 

tb tac aaa aaa gca ggc tttj gaa aac ctg tat ttt caa gga 



<^AV^t!ybkt/|ttt cgt ccg aaa\ ctt ttg gac ata aaa gtt cct 
Lou "T^r U/* A<^ Fht Glu Hn L*m Tyr Phe GU^ 

MAcX 6^11. S^iZ 6^*-X 

ca)fcatg tct g jfcc ga c tc fcr ate eggtacj co la att cgc 

gta tkc tcTil cag ct^g tec tag^geje atg get taajgcg 

^ Met Arj Ser- V-l "TVf ^ A*5 Xi* 



Daatli aq ja att cgc [ggc c gc ac fr: cga gat 



tct taajgcg ccg gjcg tga gctj^cta tag atcj 



at ts tag 



acc cag 
tgg gtc 



ctt tjc fr^ S< tx^yg&>s^gX- y 7 " 
gaa aga aca tj^t p^j^z/j 



WO 00/52027 PCT/US00/05432 



pENTR9 2735 bp 



Location (Base Nos, 
67 . . 166 
339 . . 644 
673 . . 772 
895 . . 1704 
1809. .2382 



Gene Encoded 

attLl 

ccdB 

attL2 

KmR 

ori 



1 CTGACGGATG GCCTTTTTGC GTTTCTACAA 

61 GGGCCCCAAA TAATGATTTT ATTTTGACTG 

121 AAGCAATGCT TTTTTATAAT GCCAACTTTG 

181 TTTCAAGGAC AT ATGAGAT C TGTCGACTGG 

241 AGATAACAGT ATGCGTATTT GCGCGCTGAT 

3 01 TAT AC C CG AA GT ATGT C AAA AAGAGGTGTG 

361 ATAAAAGAGA GAG C CG TT AT CGTCTGTTTG 

421 CCGGGCGACG G AT AGTG AT C CCCCTGGCCA 

481 GTGAACTTTA CCCGGTGGTG CATATCGGGG 

541 TGGC CAGTGT GCCGGTCTCC GTTATCGGGG 

6 01 ATGACATCAA AAACGCCATT AAC CTGATGT 

661 C GAG AT AT C T AGACCCAGCT TTCTTGTACA 

721 CAATTTGTTG CAACGAACAG GTCACTATCA 

781 CTGCAGCTCT GGCCCGTGTC TCAAAATCTC 

841 TCAT CATGAA CAATAAAACT GTCTGCTTAC 

901 CATATTCAAC GGGAAACGTC GAGGCCGCGA 

961 GGGTATAAAT GGGCTCGCGA TAATGTCGGG 

1021 GGGAAGCCCG ATGCGCCAGA GTTGTTTCTG 

1081 GTTACAGATG AGATGGTCAG ACTAAACTGG 

1141 AAGCATTTTA TCCGTACTCC TGATGATGCA 

12 01 ACAGCATTCC AGGTATTAGA AGAATATCCT 

12 61 GCAGTGTCCC TGCGCCGGTT GCATTCGATT 

13 21 CGCGTATTTC GTCTCGCTCA GGCGCAATCA 
13 81 GATTTTGATG AC GAG C GTAA TGGCTGGCCT 
1441 CTTTTGC CAT TCTCACCGGA TTCAGTCGTC 

15 01 ATTTTTGACG AGGGGAAATT AATAGGTTGT 
1561 CGATACCAGG ATCTTGCCAT CCTATGGAAC 
1621 AAACGGCTTT TT C AAAAATA TGGTATTGAT 

16 81 TTGATGCTCG ATGAGTTTTT CTAATCAGAA 
1741 TGGGCCCCGT TCCACTGAGC GTCAGACCCC 
18 01 CCTTTTTTTC TGCGCGTAAT CTGCTGCTTG 

18 61 GTTTGTTTGC CGGATCAAGA G CT AC C AAC T 
1921 G C G C AG AT AC CAAATACTGT TCTTCTAGTG 

19 81 TCTGTAGCAC CGCCTACATA CCTCGCTCTG 
2 041 GGCGATAAGT CGTGTCTTAC CGGGTTGGAC 
2101 CGGTCGGGCT GAACGGGGGG TTCGTGCACA 
2161 GAACTGAGAT ACCTACAGCG TGAGCTATGA 
22 21 GCGGACAGGT AT C C GGTAAG CGGCAGGGTC 
22 81 GGGGGAAACG CCTGGTATCT TTATAGTCCT 
2 3 41 CGATTTTTGT GATGCTCGTC AGGGGGGCGG 
24 01 TTTTTACGGT TCCTGGCCTT TTGCTGGCCT 

24 61 CCTGATTCTG TGGATAACCG TATTACCGCT 
2 521 AGCGAGAGTA GGGAACTGCC AG G CATC AAA 

25 81 CCTTTCGTTT TATCTGTTGT TTGTCGGTGA 
2 641 GAG CGGATTT GAACGTTGTG AAGCAACGGC 
2 7 01 AAACTGCCAG GCATCAAACT AAGCAGAAGG 



ACTCTTCCTG T TAG T T AG TT ACTTAAGCTC 

AT AGTGAC CT GTTCGTTGCA ACAAATTGAT 

TACAAAAAAG CAGGCTTTGA AAACCTGTAT 

ATCCGGTACC GAATTCGCTT ACTAAAAGCC 

TTTTGCGGTA TAAGAATATA TACTGATATG 

C TT C TAG AAT GCAGTTTAAG GTTTACACCT 

TGGATGTACA G AGTG AT ATT ATTGACACGC 

GTGCACGTCT GCTGTCAGAT AAAGTCTCCC 

ATGAAAGCTG GCGCATGATG ACCACCGATA 

AAGAAGTGGC TGATCTCAGC CACCGCGAAA 

TCTGGGGAAT AT AG AAT T C G CGGCCGCACT 

AAGTTGGCAT TATAAGAAAG CATTGCTTAT 

GTCAAAATAA AAT CAT TAT T TGCCATCCAG 

TGATGTTACA TTGCACAAGA TAAAAATATA 

ATAAACAGTA ATACAAGGGG TGTTATGAGC 

TTAAATTCCA ACATGGATGC TGATTTATAT 

CAATCAGGTG CGACAATCTA TCGCTTGTAT 

AAACATGGCA AAGGTAGCGT TGCCAATGAT 

CTGACGGAAT TTATGCCTCT TCCGACCATC 

TGGTTACTCA CCACTG CGAT CCCCGGAAAA 

GATTCAGGTG AAAATATTGT TGATGCGCTG 

CCTGTTTGTA ATTGTCCTTT T AAC AG C GAT 

CGAATGAATA ACGGTTTGGT TGATGCGAGT 

GTTGAACAAG TCTGGAAAGA AATG CAT AAA 

ACTCATGGTG ATTTCTCACT TGATAACCTT 

ATTGATGTTG GACGAGTCGG AAT C G C AG AC 

TGCCTCGGTG AGTTTTCTCC T T C ATT AC AG 

AAT C CTG AT A TGAATAAATT GCAGTTTCAT 

TTGGTTAATT GG TTGTAAC A TTATT C AG AT 

GTAGAAAAGA TCAAAGGATC TTCTTGAGAT 

CAAACAAAAA AACCACCGCT ACCAGCGGTG 

CTTTTTCCGA AGGTAACTGG CTTCAGCAGA 

TAGCCGTAGT TAG G C C AC C A CTTCAAGAAC 

CTAATCCTGT TACCAGTGGC TGCTGCCAGT 

TCAAGACGAT AGTTACCGGA TAAGGCGCAG 

CAGCCCAGCT TGGAG CGAAC GACCTACACC 

GAAAGCGCCA CGCTTCCCGA AGGGAGAAAG 

GG AAC AG GAG AGCGCACGAG GGAGCTTCCA 

GTCGGGTTTC GCCACCTCTG ACTTGAGCGT 

AGCCTATGGA AAAACGCCAG CAACGCGGCC 

TTTGCTCACA TGTTCTTTCC TGCGTTATCC 

AGCATGGATC TCGGGGACGT CTAACTACTA 

TAAAACGAAA GGCTCAGTCG GAAGACTGGG 

ACGCTCTCCT GAGTAGGACA AATCCGCCGG 

CCGGAGGGTG GCGGGCAGGA CGCCCGCCAT 
CCATC 



WO 00/52027 



Ml*, 



PCT/US00/05432 



O 



Figure 1 1M Cloning sites of the gfckQi Vector pgMteio 



— - - 11 



j tac aaa aaa gca ggc 
^^^^a^ ^t^jfetxl ttt cat 

~L*m Ujs ^* <^ ^ 4^ 



c gam. eta agv aaa tac tta ca[t. 
g ctt gat tec ttt atg aat gta 



atg gga laccl aat tea 
"tkc cct tg/gy tta agt 



5uJ 



gac 



cag ctg 



tofe 



ate 



egg tac 



ace tag 

Trf out 



get 



atg gc 



att cgc 

taa|gcg 



D^&th. -« 

(ccdB) - 



■- ag /a att cgc [ ggc c gc a cjt cga gat f at jc ta^ 
tct ta&| gcg ccg ^cg tga gct|cta^tag at< 



acc cag 
tgg gtc 



WO 00/52027 / PCT/US00/05432 

23 /fro 



pENTRlO 2738 bp 



Location (Base Nos . ) Gene Encoded 

67 . .166 attLl 

342 . . 647 ccdB 

676 . . 775 attL2 

898 . .1707 KmR 

1812 . . 2385 ori 



1 CTGACGGATG GCCTTTTTGC GTTTCTACAA 
61 GGGCCCCAAA TAATGATTTT ATTTTGACTG 
121 AAGCAATGCT TTTTTATAAT GCCAACTTTG 
181 TACTTACATA TGGGAACCAA TTCAGTCGAC 
241 G C C AGATAAC AGTATGCGTA TTTGCGCGCT 
3 01 ATGTATACCC GAAGTATGTC AAAAAGAGGT 

3 61 CCTATAAAAG AG AG AG C C G T TATCGTCTGT 
421 CGCCCGGGCG ACGGATGGTG ATCCCCCTGG 

4 81 CCCGTGAACT TTACCCGGTG GTGCATATCG 
541 ATATGGCCAG TGTGCCGGTC TCCGTTATCG 
6 01 AAAATGACAT CAAAAACGCC ATTAACCTGA 
661 ACTCGAGATA TCTAGACCCA GCTTTCTTGT 
721 TATCAATTTG TTGCAACGAA CAGGTCACTA 
781 CAGCTGCAGC TCTGGCCCGT GTCTCAAAAT 
841 AT AT CAT CAT GAACAATAAA ACTGTCTGCT 
901 AG C CAT ATT C AACGGGAAAC GTCGAGGCCG 
961 TATGGGTATA AATGGGCTCG CGATAATGTC 

1021 TATGGGAAGC CCGATGCGCC AGAGTTGTTT 
1081 GATGTTACAG ATGAGATGGT CAGACTAAAC 
1141 ATCAAGCATT TTATCCGTAC TCCTGATGAT 
12 01 AAAAC AG CAT TCCAGGTATT AGAAGAATAT 

12 61 CTGGCAGTGT TCCTGCGCCG GTTGCATTCG 
1321 GATCGCGTAT TTCGTCTCGC TCAGGCGCAA 

13 81 AGTGATTTTG ATGACGAGCG TAATGGCTGG 
1441 AAACTTTTGC CATTCTCACC GGATTCAGTC 

15 01 CTTATTTTTG ACGAGGGGAA ATTAATAGGT 
1561 G AC C GAT AC C AGGATCTTGC CATC CTATGG 
1621 CAGAAACGGC TTTTTCAAAA ATATGGTATT 

16 81 CATTTGATGC TCGATGAGTT TTTCTAATCA 
1741 GATTGGGCCC CGTTCCACTG AG C GT C AG AC 
18 01 GATCCTTTTT TTCTGCGCGT AATCTGCTGC 
1861 GTGGTTTGTT TG CCGGATC A AGAGCTACCA 
1921 AG AG C G C AG A TACCAAATAC TGTTCTTCTA 
1981 AACTCTGTAG CACCGCCTAC ATACCTCGCT 
2 041 AGTGGCGATA AGTCGTGTCT TACCGGGTTG 
2101 CAGCGGTCGG GCTGAACGGG GGGTTCGTGC 
2161 AC C G AAC TG A GATACCTACA GCGTGAGCTA 
2 221 AAGGCGGACA GGTATCCGGT AAGCGGCAGG 
22 81 CCAGGGGGAA ACGCCTGGTA TCTTTATAGT 
2 341 CGTCGATTTT TGTGATGCTC GTCAGGGGGG 
24 01 GCCTTTTTAC GGTTCCTGGC CTTTTGCTGG 
2 461 TCCCCTGATT CTGTGGATAA C CG T ATT AC C 
2 521 CTAAGCGAGA GTAGGGAACT GCCAGGCATC 
2 581 GGGCCTTTCG TTTTATCTGT TGTTTGTCGG 
2 641 CGGGAGCGGA TTTGAACGTT GTGAAGCAAC 
2 7 01 CATAAACTGC CAGGCATCAA ACTAAGCAGA 



ACTCTTCCTG TTAGTTAGTT ACTTAAGCTC 
ATAGTGACCT GTTCGTTGCA ACAAATTGAT 
TACAAAAAAG CAGGCTTCGA AC T AAG G AAA 
TGGATCCGGT ACCGAATTCG CTTACTAAAA 
GATTTTTGCG GTATAAGAAT ATATACTGAT 
GTGCTTCTAG AATGCAGTTT AAGGTTTACA 
TTGTGGATGT ACAGAGTGAT ATTATTGACA 
CCAGTGCACG TCTGCTGTCA GATAAAGTCT 
GGGATGAAAG CTGGCGCATG ATGACCACCG 
GGGAAGAAGT GGCTGATCTC AGCCACCGCG 
TGTTCTGGGG AATATAGAAT TCGCGGCCGC 
ACAAAGTTGG CATTATAAGA AAGCATTGCT 
TCAGTCAAAA TAAAATCATT ATTTGCCATC 
CTCTGATGTT ACATTGCACA AGATAAAAAT 
TACATAAACA GTAATACAAG GGGTGTTATG 
CGATTAAATT CCAACATGGA TGCTGATTTA 
GGGCAAT CAG GTGCGACAAT CTATCGCTTG 
CTGAAACATG GCAAAGGTAG CGTTGCCAAT 
TGGCTGACGG AATTTATGCC TCTTCCGACC 
GCATGGTTAC TCACCACTGC GATCCCCGGA 
CCTGATTCAG GTGAAAATAT TGTTGATGCG 
ATTCCTGTTT GTAATTGTCC TTTTAACAGC 
T C AC GAATG A ATAACGGTTT GGTTGATGCG 
CCTGTTGAAC AAGT C TGGAA AGAAATG C AT 
GTCACTCATG GTGATTTCTC ACTTGATAAC 
TGTATTGATG TTGGACGAGT CGGAATCGCA 
AACTGCCTCG GTGAGTTTTC TCCTTCATTA 
GATAATCCTG ATATGAATAA ATTGCAGTTT 
GAATTGGTTA ATTGGTTGTA AC ATT ATT C A 
C C C GT AG AAA AGATCAAAGG ATCTTCTTGA 
TTGCAAACAA AAAAAC C AC C GCTACCAGCG 
ACTCTTTTTC CGAAGGTAAC TGGCTTCAGC 
GTGTAGCCGT AGTTAGGCCA C C AC TT C AAG 
CTGCTAATCC TGTTACCAGT GGCTGCTGCC 
GACT C AAG AC GATAGTTACC GGATAAGGCG 
ACACAGCCCA GCTTGGAGCG AAC G AC C T AC 
TGAGAAAGCG CCACGCTTCC CGAAGGGAGA 
GTCGGAACAG GAGAGCGCAC GAGGGAGCTT 
CCTGTCGGGT TTCGCCACCT CTGACTTGAG 
CGGAGCCTAT GGAAAAACGC CAGCAACGCG 
CCTTTTGCTC ACATGTTCTT TCCTGCGTTA 
GCTAGCATGG ATCTCGGGGA CGTCTAACTA 
GAATAAAACG AAAGGCTCAG T C GG AAG AC T 
TGAACGCTCT C C TG AG T AGG ACAAATCCGC 
GGCCCGGAGG GTGGCGGGCA GGACGCCCGC 
AGG C CATC 
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Figure ■ 7€>A\ Cloning Sites of the Entry Vector pENTRll 



Int 



attLl 



S.D. 



Kozak XmnI 



S.D. 



TT TG TAC AAA AAA GCA GGC TTC GAA GGA GAT AGA ACC 
AAC ATG TTTl TTT CGT CCG A|A G CTT CCT CTA TCT TGG 



V 



AAT TCT CTA AGG AAA TAC 
TTA AGA GAT-TCC TTT ATG< 



Leu Tyr Lys Lys Ala Gly Phe Glu Gly Asp Arg Thr Asn Ser Leu Arg Lys Tyr 



Kozak Ncol Sail 



BamHI 



Kpnl EcoRI 



EcoRI 



NotI 



TTA ACt_ATG g|tCJ3AC TC fc ATC CG G TAC[ CG^ATT C-- 
AAT TGG TAC CAG C*l)G ACC TAG I GCfc ATG GCT5SA1 G 

Leu Thr Met Val Asp Trp lie Arg Tyr Arg lie 



ccdB --G 1 AAT T CG dGG CCG 
C TTA i^GC GCC~GGb 

V V 
Asn Ser Arg Pro 



Xhol 



EcoRV Xbal 



Int 



attL2 



CAC | TCG A GA t{at | CTA G AC CCA GCT T Tf2 TTG TAC AAA G 
GTG AGC TjCT ATA GAT CfTG GGT CGA AAG AAC ATG 



TTT C 



His Ser Arg Tyr Leu Asp Pro Ala Phe Leu Tyr Lys 



WO 00/52027 



C TG AC GG AT G 
GGGCCCCAAA 
AAGCAATGCT 
ACCAATTCTC 
CTAAAAGCCA 
AC TG AT AT G T 
T T T AC AC C T A 
TTGACACGCC 
AAGTCTCCCG 
C C AC C GAT AT 
ACCGCGAAAA 
GGCCGCACTC 
ATTGCTTATC 
GCCATCCAGC 
AAAAATATAT 
GTTATGAGCC 
GATTTATATG 
CGCTTGTATG 
GCCAATGATG 
CCGACCATCA 
CCCGGAAAAA 
GATGCGCTGG 
AACAGCGATC 
GATGCGAGTG 
ATGCATAAAC 
GATAACCTTA 
AT C G C AG AC C 
TC ATT AC AG A 
CAGTTTCATT 
TATTCAGATT 
T C TT G AG AT C 
CCAGCGGTGG 
TT C AG C AG AG 
TTCAAGAACT 
GCTGCCAGTG 
AAGGCGCAGC 
ACCTACACCG 
GGGAGAAAGG 
GAGCTTCCAG 
CTTGAGCGTC 
AACGCGGCCT 
GCGTTATCCC 
TAACTACTAA 
AAGACTGGGC 
ATCCGCCGGG 
GCCCGCCATA 



GCCTTTTTGC 
TAATGATTTT 
TTTTTATAAT 
TAAGGAAATA 
GATAACAGTA 
ATACCCGAAG 
TAAAAGAGAG 
CGGGCGACGG 
TGAACTTTAC 
GGCCAGTGTG 
TG AC AT C AAA 
GAGATATCTA 
AATTTGTTGC 
TGCAGCTCTG 
CATCATGAAC 
ATATTCAACG 
GGTATAAATG 
GGAAGCCCGA 
TTACAGATGA 
AGCATTTTAT 
CAGCATTCCA 
CAGTGTTCCT 
GCGTATTTCG 
ATTTTGATGA 
TTTTGCCATT 
TTTTTGACGA 
GAT AC C AGG A 
AACGGCTTTT 
TGATGCTCGA 
GGGCCCCGTT 
CTTTTTTTCT 
TTTGTTTGCC 
C G C AG AT AC C 
CTGTAGCACC 
GCGATAAGTC 
GGTCGGGCTG 
AACTGAGATA 
CGGACAGGTA 
GGGGAAACGC 
GATTTTTGTG 
TTTTACGGTT 
CTGATTCTGT 
G C GAG AG TAG 
CTTTCGTTTT 
AGCGGATTTG 
AACTGCCAGG 



GTTTCTACAA 
ATTTTGACTG 
GCCAACTTTG 
CTTAACCATG 
TGCGTATTTG 
TATGTCAAAA 
AG C CGTTAT C 
ATAGTGATCC 
CCGGTGGTGC 
CCGGTCTCCG 
AAC G C CAT T A 
GACCCAGCTT 
AACGAACAGG 
GCCCGTGTCT 
AATAAAACTG 
GGAAACGTCG 
GGCTCGCGAT 
TGCGCCAGAG 
GATGGTCAGA 
CCGTACTCCT 
GGTATTAGAA 
GCGCCGGTTG 
TCTCGCTCAG 
CGAGCGTAAT 
CTCACCGGAT 
GGGGAAATTA 
TCTTGCCATC 
T C AAAAAT AT 
TGAGTTTTTC 
CCACTGAGCG 
GCGCGTAATC 
GGATCAAGAG 
AAATACTGTT 
G C C TAC AT AC 
GTGTCTTACC 
AACGGGGGGT 
C C TAC AG C GT 
TCCGGTAAGC 
CTGGTATCTT 
ATGCTCGTCA 
CCTGGCCTTT 
G GAT AAC C G T 
GGAACTGCCA 
ATCTGTTGTT 
AACGTTGTGA 
CATCAAACTA 



ACTCTTCCTG 
ATAGTGACCT 
TACAAAAAAG 
GTCGACTGGA 
CGCGCTGATT 
AGAGGTGTGC 
GTCTGTTTGT 
CCCTGGCCAG 
ATATCGGGGA 
TTATCGGGGA 
ACCTGATGTT 
TCTTGTACAA 
TC ACTAT C AG 
CAAAATCTCT 
TCTGCTTACA 
AGGCCGCGAT 
AATGTCGGGC 
TTGTTTCTGA 
CTAAACTGGC 
GATGATGCAT 
GAATATCCTG 
C ATT C GAT T C 
GCGCAATCAC 
GGCTGGCCTG 
TCAGTCGTCA 
AT AG G T TGT A 
CTATGGAACT 
GGTATTGATA 
TAATCAGAAT 
TCAGACCCCG 
TGCTGCTTGC 
CTACCAACTC 
CTTCTAGTGT 
CTCGCTCTGC 
GGGTTGGACT 
TCGTGCACAC 
GAGCTATGAG 
GGCAGGGTCG 
TATAGTCCTG 
GGGGGGCGGA 
TGCTGGCCTT 
ATTACCGCTA 
GGCATCAAAT 
TGTCGGTGAA 
AGCAACGGCC 
AGCAGAAGGC 



TTAGTTAGTT 
GTTCGTTGCA 
CAGGCTTCGA 
TCCGGTACCG 
TTTGCGGTAT 
TTCTAGAATG 
GGATG T AC AG 
TGCACGTCTG 
TGAAAGCTGG 
AGAAGTGGCT 
CTGGGGAATA 
AGTTGGCATT 
TCAAAATAAA 
GATGTTACAT 
TAAACAGTAA 
TAAATTCCAA 
AATCAGGTGC 
AACATGGCAA 
TGACGGAATT 
GGTTACTCAC 
ATT C AGGTG A 
CTGTTTGTAA 
GAATGAATAA 
TTGAACAAGT 
CTCATGGTGA 
TTGATGTTGG 
GCCTCGGTGA 
ATCCTGATAT 
TGGTTAATTG 
TAGAAAAGAT 
AAACAAAAAA 
TTTTTCCGAA 
AGCCGTAGTT 
TAATCCTGTT 
CAAGACGATA 
AGCCCAGCTT 
AAAGCGCCAC 
G AAC AG GAGA 
TCGGGTTTCG 
GCCTATGGAA 
TTGCTCACAT 
GCATGGATCT 
AAAACGAAAG 
CGCTCTCCTG 
CGGAGGGTGG 
CATC 



ACTTAAGCTC 
ACAAATTGAT 
AGGAGATAGA 
AATTCGCTTA 
AAGAATATAT 
CAGTTTAAGG 
AGTGATATTA 
C TG T C AG AT A 
CGCATGATGA 
GATCTCAGCC 
TAGAATTCGC 
ATAAGAAAGC 
ATCATTATTT 
TGCACAAGAT 
TACAAGGGGT 
CATGGATGCT 
GACAATCTAT 
AGGTAGCGTT 
TATGCCTCTT 
CACTGCGATC 
AAATATTGTT 
TTGTCCTTTT 
CGGTTTGGTT 
CTGGAAAGAA 
TTTCTCACTT 
ACGAGTCGGA 
GTTTTCTCCT 
GAATAAATTG 
GTTGTAACAT 
CAAAGGATCT 
ACCACCGCTA 
GGTAACTGGC 
AGGCCACCAC 
ACCAGTGGCT 
G T TAC CG GAT 
GG AG C G AAC G 
GCTTCCCGAA 
GCGCACGAGG 
CCACCTCTGA 
AAACGCCAGC 
GTTCTTTCCT 
CGGGGACGTC 
GCTCAGTCGG 
AG T AGG AC AA 
CGGGCAGGAC 



PCT7US00/05432 



l 

61 
121 
181 
241 
301 
361 
421 
481 
541 
601 
661 
721 
781 
841 
901 
961 
1021 
1081 
1141 
1201 
1261 
1321 
1381 
1441 
1501 
1561 
1621 
1681 
1741 
1801 
1861 
1921 
1981 
2041 
2101 
2161 
2221 
2281 
2341 
2401 
2461 
2521 
2581 
2641 
2701 



pENTRl 1 2744 bp (rotated to position 2578) 



Location (Base Nos.) 
67 . . 166 
348 . . 653 
683 . . 781 
904 . . 1713 
1818 . .2391 



Gene Encoded 

attLl 

ccdB 

attL2 

KmR 

ori 
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Figure ^-'p *>tsri_ Native Protein Expression in coli 



atgagct 
tactcgac 



icfra 



ttaat catccggctc 



gacatattaat. "catccggctc gE 
ctgtjtaatta gtaggccgag c htatt 

aatttcacac aggaaacaga caggtatagg atc^caagtt^frtjlpaaa^a 
ttaaagtgtg tcctttgtct gtccatatcc tag :gttcaa acatgtt J 



acaat 



tg tggkattgtg agcggataac 
ac accttaacac tcgcctattg 



61 




WO 00/52027 PCT/US00/05432 

'0 



pDESTl 6464 bp 



Location (Base Nos 
216 . .257 
397 . .273 



647 . 
1426 
1648 
1994 
2598 
4104 
4504 
5340 



1306 
. 1510 
. 1953 
.2118 
. 3503 
.4264 
.4941 
.6420 



Gene Encoded 
Trc promoter 
attRl 
CmR 

inactivated ccdA 

ccdB 

attR2 

ampR 

ori 

flori (fl intergenic region) 
laclq 



1 GTTTG AC AG C TTATCATCGA CTGCACGGTG 
61 GGAAGCTGTG GTATGGCTGT GCAGGTCGTA 
121 GCACTCCCGT TCTGGATAAT GTTTTTTGCG 
181 TGAAATGAGC TGTTGACAAT TAATCATCCG 
241 ATAACAATTT CATCGCGAGG TACCAAGCTA 
3 01 AAACGTAAAA TGATATAAAT AT C AATATAT 
361 CATAATACTG TAAAACACAA CAT AT C C AGT 
421 ACCCGACGCA CTTTGCGCCG AATAAATACC 
481 AAATCCTGGT GTCCCTGTTG ATACCGGGAA 
541 CGTTGATCGG CACGTAAGAG GTTCCAACTT 
6 01 CGTATTTTTT GAGTTATCGA GATTTTCAGG 
661 C AC T G GAT AT ACCACCGTTG ATATATCCCA 
721 TCAGTCAGTT GCTCAATGTA CCTATAACCA 
781 AAAGAC CGTA AAGAAAAATA AGCACAAGTT 
841 CCTGATGAAT GCTCATCCGG AATTC CGTAT 
901 GGATAGTGTT CACCCTTGTT AC AC CGTTTT 
961 CTGGAGTGAA TACCACGACG ATTTCCGGCA 
1021 GTGTTACGGT GAAAACCTGG CCTATTTCCC 
1081 CTCAGCCAAT CCCTGGGTGA GTTTCACCAG 
1141 CTTCTTCGCC CCCGTTTTCA CCATGGGCAA 
12 01 GCCGCTGGCG ATTCAGGTTC ATCATGCCGT 

12 61 TAATGAATTA CAACAGTACT GCGATGAGTG 

13 21 CTTACTAAAA G C C AG AT AAC AGTATG CGTA 

13 81 ATATACTGAT ATGTATACCC GAAGTATGTC 

14 4 1 ACAGTGACAG TTGACAGCGA CAGCTATCAG 

15 01 CCGGTCTGGT AAGCACAACC ATGCAGAATG 
1561 AAGCGGAAAA TCAGGAAGGG ATGGCTGAGG 
1621 TTGCTGACGA GAACAGGGAC TGGTGAAATG 

16 81 AGCCGTTATC GTCTGTTTGT GGATGTACAG 
1741 ATGGTGATCC CCCTGGCCAG TGCACGTCTG 

,18 01 CCGGTGGTGC ATATCGGGGA TGAAAGCTGG 
1'8 61 CCGGTCTCCG TTATCGGGGA AGAAGTGGCT 
1921 AACGCCATTA AC CTGATGTT CTGGGGAATA 
1981 TCTGCAGGTC GACCATAGTG ACTGGATATG 
2 0 4 1 - v TTTATGCAAA AT C TAATTTA ATATATTGAT 
2101 TTTCTTGTAC AAAGTGGTGA TAGCTTGGCT 
2161 TGATACAGAT TAAATCAGAA CGCAGAAGCG 
2221 GTAGCGCGGT GGTCCCACCT GACCCCATGC 
2281 ATGGTAGTGT GGGGTCTCCC CATG CGAGAG 
2 341 AAGGCTCAGT CGAAAGACTG GGCCTTTCGT 
24 01 CTGAGTAGGA CAAATCCGCC GGGAGCGGAT 
24 61 TGGCGGGCAG GACGCCCGCC ATAAACTGCC 
2 521 ACGGATGGCC TTTTTGCGTT TCTACAAACT 



CACCAATGCT TCTGGCGTCA GGCAGCCATC 
AATCACTGCA TAATTCGTGT CGCTCAAGGC 
CCGACATCAT AACGGTTCTG GCAAATATTC 
GTCCGTATAA TCTGTGGAAT TGTGAGCGGG 
TCACAAGTTT GTACAAAAAA GCTGAACGAG 
TAAATTAGAT TTTGCATAAA AAAC AG A C T A 
CACTATGGCG GCCGCTAAGT TGGCAGCATC 
TGTGACGGAA GATCACTTCG CAGAATAAAT 
GCCCTGGGCC AACTTTTGGC GAAAATGAGA 
TCACCATAAT GAAATAAGAT CACTACCGGG 
AGCTAAGGAA GCTAAAATGG AGAAAAAAAT 
ATGGCATCGT AAAGAACATT TTGAGG CATT 
GACCGTTCAG CTGGATATTA CGGCCTTTTT 
TTATCCGGCC TTTATTCACA TTCTTGCCCG 
G G C AATG AAA GACGGTGAGC TGGTGATATG 
CCATGAGCAA ACTGAAACGT TTTCATCGCT 
GTTTCTACAC ATATATTCGC AAGATGTGGC 
TAAAGGGTTT ATTGAGAATA TGTTTTTCGT 
TTTTGATTTA AACGTGGCCA ATATGGACAA 
ATATTAT AC G CAAGG CGACA AGGTGCTGAT 
CTGTGATGGC TTCCATGTCG GCAGAATGCT 
GCAGGGCGGG GCGTAAACGC GTGGATCCGG 
TTTGCGCGCT GATTTTTGCG GTATAAGAAT 
AAAAAGAGGT GTGCTATGAA GCAGCGTATT 
TTGCTCAAGG CATATATGAT GTCAATATCT 
AAGCCCGTCG TCTGCGTGCC GAACGCTGGA 
TCGCCCGGTT TATTGAAATG AACGGCTCTT 
CAGTTTAAGG TTTACACCTA TAAAAGAGAG 
AGTGATATTA TTGACACGCC CGGGCGACGG 
CTGTCAGATA AAGTCTCCCG TGAACTTTAC 
CGCATGATGA CCACCGATAT GGCCAGTGTG 
GATCTCAGCC ACCGCGAAAA TG A CATC AAA 
TAAATGT C AG GCTCCCTTAT AC AC AG C C AG 
TTGTGTTTTA CAGTATTATG TAGTCTGTTT 
ATTTATAT C A TTTTACGTTT CTCGTTCAGC 
GTTTTGGCGG ATGAGAGAAG ATTTTCAGCC 
GTCTGATAAA ACAGAATTTG CCTGGCGGCA 
CGAACTCAGA AGTGAAACGC CGTAGCGCCG 
TAGGGAACTG CCAGGCATCA AATAAAACGA 
TTTATCTGTT GTTTGTCGGT GAACGCTCTC 
TTGAACGTTG CGAAGCAACG GCCCGGAGGG 
AGG CATC AAA TTAAGCAGAA GGCCATCCTG 
CTTTTTGTTT ATTTTTCTAA ATACATTCAA- 
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2581 ATATGTATCC GCTCATGAGA CAATAACCCT GATAAATGCT TCAATAATAT TGAAAAAGGA 
2641 AGAGTATGAG TATTCAACAT TTCCGTGTCG CCCTTATTCC CTTTTTTGCG GCATTTTGCC 
2701 TTCCTGTTTT TGCTCACCCA GAAACGCTGG TG AAAG T AAA AGATGCTGAA GATCAGTTGG 
2 761 GTGCACGAGT GGGTTACATC GAACTGGATC TCAACAGCGG TAAGATCCTT GAGAGTTTTC 
2821 GCCCCGAAGA ACGTTTTCCA ATGATG AG C A CTTTTAAAGT TCTGCTATGT GGCGC GGTAT 
2881 TATCCCGTGT TGACGCCGGG CAAGAGCAAC TCGGTCGCCG CATACACTAT TCTCAGAATG 

2 941 ACTTGGTTGA GTACTCACCA GT C AC AG AAA AGCATCTTAC GGATGGCATG ACAGTAAGAG 

3 001 AATTATG C AG TGCTGCCATA AC CATGAGTG ATAACACTGC GGCCAACTTA CTTCTGACAA 
3 061 CGATCGGAGG ACCGAAGGAG CTAAC CGCTT TTTTGCACAA CATGGGGGAT CATGTAACTC 
3121 GCCTTGATCG TTGGGAACCG GAGCTGAATG AAG C CAT AC C AAACGACGAG CGTG AC AC CA 
3181 CGATGCCTAC AGCAATGGCA AC AACGTTG C GCAAACTATT AACTGGCGAA CTACTTACTC 
3241 TAGCTTCCCG GCAACAATTA ATAGACTGGA TGGAGG CGGA TAAAGTTGCA GGACCACTTC 
3301 TGCGCTCGGC CCTTCCGGCT GGCTGGTTTA TTGCTGATAA ATCTGGAGCC GGTGAGCGTG 
3361 GGTCTCGCGG TATCATTGCA GCACTGGGGC CAGATGGTAA GCCCTCCCGT ATCGTAGTTA 
3421 TCTACACGAC GGGGAGTCAG GCAACTATGG ATG AAC G AAA TAGACAGATC GCTGAGATAG 
3481 GTGCCTCACT GATTAAGCAT TGGTAACTGT CAGACCAAGT TT AC T CAT AT ATACTTTAGA 
3 541 TTGATTTAAA ACTT CATTTT TAAT TTAAAA GGATCTAGGT G AAG AT C C TT TTTGATAATC 
36 01 T C ATG AC C AA AATCCCTTAA CGTGAGTTTT CGTTCCACTG AGCGTCAGAC CCCGTAGAAA 
3661 AGATCAAAGG ATCTTCTTGA GATCCTTTTT TTCTGCGCGT AATCTGCTGC TTGCAAACAA 
3 721 AAAAAC C AC C GCTACCAGCG GTGGTTTGTT TGCCGGATCA AGAGCTACCA ACTCTTTTTC 
3781 CGAAGGTAAC TGGCTTCAGC AGAGCG CAGA TACCAAATAC TGTCCTTCTA GTGTAGCCGT 
3 841 AGTTAGGCCA CCACTTCAAG AACTCTGTAG CACCGCCTAC ATACCTCGCT CTGCTAATCC 
3 901 TGTTACCAGT GGCTGCTGCC AG TG G C GAT A AGTCGTGTCT TACCGGGTTG G A C T C AAG AC 

3 961 GATAGTTACC GGATAAGGCG CAGCGGTCGG GCTGAACGGG GGGTTCGTGC ACACAGCCCA 

4 021 GCTTGGAGCG AACGACCTAC AC C G AAC TG A GATACCTACA GCGTGAGCTA TGAGAAAGCG 
4 081 CCACGCTTCC CGAAGGGAGA AAGGCGGACA GGTATCCGGT AAGCGGCAGG GTCGGAACAG 
4141 GAGAGCGCAC GAGGGAGCTT CCAGGGGGAA ACGCCTGGTA TCTTTATAGT CCTGTCGGGT 
42 01 TTCGCCACCT CTGACTTGAG CGTCGATTTT TGTGATGCTC GTCAGGGGGG CGGAGCCTAT 
4 261 GGAAAAACGC CAGCAACGCG GCCTTTTTAC GGTTCCTGGC CTTTTGCTGG CCTTTTGCTC 
4 321 ACATGTTCTT TCCTGCGTTA TCCCCTGATT CTGTGGATAA CCGTATTACC GCCTTTGAGT 
4 381 GAGCTGATAC CGCTCGCCGC AGCCGAACGA CCGAGCGCAG CGAGTCAGTG AGCGAGGAAG 
4 441 CGGAAGAGCG CCTGATGCGG TATTTTCTCC TTACGCATCT GTGCGGTATT TCACACCGCA 
4 501 TAATTTTGTT AAAATTCGCG TTAAATTTTT GTTAAATCAG CTCATTTTTT AACCAATAGG 
4 561 CCGAAATCGG CAAAATCCCT TATAAATCAA AAGAATAGAC CGAGATAGGG TTGAGTGTTG 
4 621 TTCCAGTTTG G AAC AAG AG T C C AC TATTAA AGAACGTGGA CTCCAACGTC AAAGGG CGAA 
4 681 AAACCGTCTA TCAGGGCGAT GGCCCACTAC GTGAACCATC AC C CTAAT C A AGTTTTTTGG 
4 741 GGTCGAGGTG CCGTAAAGCA CTAAATCGGA ACCCTAAAGG GAGCCCCCGA TTTAGAGCTT 
48 01 GACGGGGAAA GCCGGCGAAC GTGGCGAGAA AG G AAG G G AA GAAAGCGAAA GGAGCGGGCG 
4 861 CTAGGGCGCT GGCAAGTGTA GCGGTCACGC TGCGCGTAAC CACCACACCC GCCGCGCTTA 
4 921 ATGCGCCGCT ACAGGGCGCG TCCATTCGCC ATTCAGGCTG CTATGGTGCA CTCTCAGTAC 

4 981 AATCTGCTCT GATGCCGCAT AGTTAAGCCA GTACCAGTCA CGTAGCGATA TCGGAGTGTA 

5 041 TACACTCCGC TATCGCTACG TGACTGGGTC ATGGCTGCGC CCCGACACCC GCCAACACCC 
5101 GCTGACGCGC CCTGACGGGC TTGTCTGCTC CCGGCATCCG CTTACAGACA AGCTGTGACC 
5161 GTCTCCGGGA GCTGCATGTG TCAGAGGTTT TCACCGTCAT CACCGAAACG CGCGAGGCAG 
5221 CAGATCAATT CGCGCGCGAA GGCGAAGCGG CATG CATTTA CGTTGACACC ATCGAATGGT 
52 81 GCAAAACCTT TCGCGGTATG GCATGATAGC GCCCGGAAGA GAGTCAATTC AGGGTGGTGA 
5 341 ATGTGAAACC AGTAACGTTA TACGATGTCG CAGAGTATGC CGGTGTCTCT T AT C AG AC C G 
54 01 TTTCCCGCGT GGTGAACCAG GCCAGCCACG TTTCTGCGAA AACGCGGGAA AAAGTGGAAG 
54 61* CGGCGATGGC GGAGCTGAAT TACATTCCCA ACCGCGTGGC ACAACAACTG GCGGGCAAAC 
5521 AGTCGTTGCT GATTGGCGTT GCCACCTCCA GTCTGGCCCT GCACGCGCCG TCGCAAATTG 
5581 TCGCGGCGAT TAAATCTCGC GCCGATCAAC TGGGTGCCAG CGTGGTGGTG TCGATGGTAG 
5641 AACGAAGCGG CGTCGAAGCC TGTAAAGCGG CGGTGCACAA TCTTCTCGCG CAACGCGTCA 
5 701 GTGGGCTGAT CATTAACTAT CCGCTGGATG AC C AG G ATG C CATTGCTGTG GAAGCTGCCT 
5 761 GCACTAATGT TCCGGCGTTA TTTCTTGATG TCTCTGACCA GACACCCATC AACAGTATTA 
5 821 TTTTCTCCCA TGAAGACGGT ACGCGACTGG GCGTGGAGCA TCTGGTCGCA TTGGGTCACC 
58 81 AG CAAATC G C GCTGTTAGCG GGCCCATTAA GTTCTGTCTC GGCGCGTCTG CGTCTGGCTG 

5 941 GCTGGCATAA ATATCTCACT CGCAATCAAA TT C AG C C GAT AGCGGAACGG GAAGGCGACT 

6 001 GGAGTGCCAT GTCCGGTTTT CAACAAACCA TGCAAATGCT GAATGAGGGC ATCGTTCCCA- 




WO 00/52027 PCT/US00/05432 



6 061 CTGCGATGCT GGTTGCCAAC GATCAGATGG 

6121 CCGGGCTGCG CGTTGGTGCG GATATCTCGG 

6181 CAT G TTAT AT CCCGCCGTTA ACCACCATCA 

6241 GCGTGGACCG CTTGCTGCAA CTCTCTCAGG 

6 3 01 CCGTCTCACT GGTGAAAAGA AAAACCACCC 

63 61 GCGCGTTGGC CGATTCATTA ATGCAGCTGG 

6421 AGTGAGCGCA ACGCAATTAA TGTGAGTTAG 



CGCTGGGCGC AATGCGCGCC ATTACCGAGT 

TAG T G G GAT A CGACGATACC GAAGACAGCT 

AACAGGATTT TCGCCTGCTG GGGCAAACCA 

GCCAGGCGGT GAAGGGCAAT CAGCTGTTGC 

TGGCACCCAA TACGCAAACC GCCTCTCCCC 

C AC G AC AG GT TTCCCGACTG GAAAGCGGGC 
CGCGAATTGA TCTG 
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Figure plx5>TZ 



His6 fusions in E. coli 
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pDEST2 6553 bp 



Location (Base Nos, 



912 . , 

1223 

1473 

2252 

2474 

2820 

3509 

5015 

5415 

6225 



962 

.1009 

.2132 

.2336 

.2779 

.2944 

.4414 

.5175 

.5852 

. 752 



Gene Encoded 

Trc 

attRl 

CmR 

inactivated ccdA 

ccdB 

attR2 

ampR 

or i 

flori (fl intergenic region) 
laclq 



1 GGCGGTGCAC AATCTTCTCG CGCAACGCGT 
61 TGAC CAGGAT GCCATTGCTG TGGAAGCTGC 
121 TGTCTCTGAC CAGACACCCA TCAACAGTAT 
181 GGGGGTGGAG CATCTGGTCG CATTGGGTCA 
241 AAGTTCTGTC TCGGCGCGTC TGCGTCTGGC 
3 01 AATTCAGCCG ATAGCGGAAC GGGAAGGCGA 
361 CATGCAAATG CTGAATGAGG GCATCGTTCC 
421 GGCGCTGGGC GCAATGCGCG CCATTACCGA 
481 GGTAGTGGGA TACGACGATA CCGAAGACAG 
541 CAAACAGGAT TTTCGCCTGC TGGGG CAAAC 
6 01 GGGCCAGGCG GTGAAGGGCA ATCAGCTGTT 
661 CCTGGCACCC AATACGCAAA CCGCCTCTCC 
721 GGCACGACAG GTTTCCCGAC TGGAAAGCGG 
781 AGCGCGAATT GATCTGGTTT GACAGCTTAT 
841 GCGTCAGGCA GCCATCGGAA GCTGTGGTAT 
901 TCGTGTCGCT CAAGGCGCAC TCCCGTTCTG 
961 GTTCTGGCAA ATATTCTGAA ATGAG CTGTT 
1021 TGGAATTGTG AGCGGATAAC AATTTCACAC 
1081 CATCACCATC ACGGCATCAC AAGTTTGTAC 
1141 ATAAATATCA ATATATTAAA TTAGATTTTG 
12 01 ACACAACATA TCCAGTCACT ATGGCGGCCG 

12 61 GCGCCGAATA AATACCTGTG ACGGAAGATC 
1321 CTGTTGATAC CGGGAAGCCC TGGGCCAACT 

13 81 TAAGAGGTTC CAACTTTCAC CATAATGAAA 

14 41 TATCGAGATT TTCAGGAGCT AAGGAAGCTA 

15 01 CCGTTGATAT ATCCCAATGG CATCGTAAAG 
1561 AATGTACCTA TAACCAGACC GTTCAGCTGG 
1621 AAAATAAGCA CAAGTTTTAT CCGGCCTTTA 

16 81 ATCCGGAATT CCGTATGGCA ATGAAAGACG 
1741 CTTGTTACAC CGTTTTCCAT GAGCAAACTG 
18 01 ACGACGATTT CCGGCAGTTT CTACACATAT 
18 61 ACCTGGCCTA TTTCCCTAAA GGGTTTATTG 
1921 GGGTGAGTTT C AC C AGTTTT GATTTAAACG 
1981 TTTTCACCAT GGGCAAATAT TATACG CAAG 
2 041~ AGGTTCATCA TGCCGTCTGT GATGGCTTCC 
2101 AGTACTGCGA TGAGTGGCAG GGCGGGGCGT 
2161 GATAACAGTA TGCGTATTTG CGCGCTGATT 
2221 ATACCCGAAG TATGTCAAAA AGAGGTGTGC 
22 81 CAGCGACAGC TATCAGTTGC TCAAGGCATA 
2 341 ACAACCATGC AGAATGAAGC CCGTCGTCTG 
24 01 GAAGGGATGG CTGAGGTCGC CCGGTTTATT 
24 61 AGGGACTGGT G AAATG C AG T TTAAGGTTTA 
2 521 GTTTGTGGAT GTACAGAGTG ATATTATTGA 



CAGTGGGCTG AT C ATTAAC T ATCCGCTGGA 
CTGCACTAAT GTTCCGGCGT TATTTCTTGA 
TATTTTCTCC CATGAAGACG GTACGCGACT 
CCAGCAAATC GCGCTGTTAG CGGGCCCATT 
TGGCTGGCAT AAATATCTCA CTCGCAATCA 
CTGGAGTGCC ATGTCCGGTT TTCAACAAAC 
CACTGCGATG CTGGTTGCCA ACGATCAGAT 
GTCCGGGCTG CGCGTTGGTG CGGATATCTC 
CTCATGTTAT ATCCCGCCGT CAACCACCAT 
CAGCGTGGAC CGCTTGCTGC AACTCTCTCA 
GCCCGTCTCA CTGGTGAAAA GAAAAACCAC 
CCGCGCGTTG GCCGATTCAT TAATGCAGCT 
GCAGTGAGCG CAACGCAATT AATGTGAGTT 
CATCGACTGC ACGGTGCACC AATGCTTCTG 
GGCTGTGCAG GTCGTAAATC ACTGCATAAT 
GATAATGTTT TTTGCGCCGA CATCATAACG 
GACAATTAAT CATCCGGTCC GTATAATCTG 
AGGAAACAGA CCATGTCGTA CTACCATCAC 
AAAAAAGCTG AACGAGAAAC GTAAAATGAT 
CATAAAAAAC AGACTACATA ATACTGTAAA 
CTAAGTTGGC AGCATCACCC GACGCACTTT 
ACTTCGCAGA ATAAATAAAT CCTGGTGTCC 
TTTGGCGAAA ATGAG AC GTT GATCGGCACG 
T AAGAT C ACT ACCGGGCGTA TTTTTTGAGT 
AAATGGAGAA AAAAATCACT GGATATACCA 
AACATTTTGA GGCATTTCAG TCAGTTGCTC 
ATATTACGGC CTTTTTAAAG AC CGTAAAGA 
TTCACATTCT TGCCCGCCTG ATGAATGCTC 
GTGAGCTGGT GATATGGGAT AGTGTTCACC 
AAACGTTTTC ATCGCTCTGG AGTGAATACC 
ATTCGCAAGA TGTGGCGTGT TAC G G TG AAA 
AG AATAT GTT TTTCGTCTCA GCCAATCCCT 
TGGCCAATAT GGACAACTTC TTCGCCCCCG 
GCGACAAGGT GCTGATGCCG CTGGCGATTC 
ATGTCGGCAG AATG CTTAAT GAATTACAAC 
AAACGCGTGG ATCCGGCTTA CTAAAAGCCA 
TTTGCGGTAT AAGAATATAT ACTGATATGT 
TATGAAGCAG CGTATTACAG TGACAGTTGA 
TATGATGTCA ATATCTCCGG TCTGGTAAGC 
CGTGCCGAAC GCTGGAAAGC GGAAAATCAG 
GAAATGAACG GCTCTTTTGC TGACGAGAAC 
CACCTATAAA AG AG AG AG C C GTTATCGTCT 
CACGCCCGGG CGACGGATGG TGATCCCCCT- 
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2 581 GGCCAGTGCA CGTCTGCTGT CAGATAAAGT CTCCCGTGAA CTTTACCCGG TGGTGCATAT 
2 641 CGGGGATGAA AGCTGGCGCA TG AT G AC C AC CGATATGGCC AGTGTGCCGG TCTCCGTTAT 
2 7 01 CGGGGAAGAA GTGGCTGATC TCAGCCACCG C GAAAATGAC ATCAAAAACG C CATTAACCT 
2 761 GATGTTCTGG G G AAT AT AAA TGTCAGGCTC CCTTATACAC AGCCAGTCTG CAGGTCGACC 
2 821 ATAGTGACTG GATATGTTGT GTTTTACAGT ATTATGTAGT CTGTTTTTTA TGCAAAATCT 
2 881 AATTTAATAT ATTGATATTT ATATCATTTT ACGTTTCTCG TTCAGCTTTC TTGTACAAAG 

2 941 TGGTGATGCC CATATGGGAA TTCAAAGGCC TACGTCGACG AGCTCACTAG TCGCGGCCGC 

3 001 TTCTAGAGGA TCCCTCGAGG CATGCGGTAC CAAGCTTGGC TGTTTTGGCG GATGAGAGAA 
3 061 GATTTTCAGC CTGATACAGA TTAAATCAGA ACGCAGAAGC GGTCTGATAA AACAGAATTT 
3121 GCCTGGCGGC AGTAGCGCGG TGGTCCCACC TGACCCCATG CCGAACTCAG AAGTGAAACG 
3181 CCGTAGCGCC GATGGTAGTG TGGGGTCTCC CCATGCGAGA GTAGGGAACT GCCAGGCATC 
3241 AAATAAAACG AAAGGCTCAG TCGAAAGACT GGGCCTTTCG TTTTATCTGT TGTTTGTCGG 
3 3 01 TGAACGCTCT CCTGAGTAGG ACAAATCCGC CGGGAGCGGA TTTGAACGTT GCGAAGCAAC 
33 61 GGCCCGGAGG GTGGCGGGCA GGACGCCCGC CATAAACTGC C AG G CAT C AA ATTAAGCAGA 
3421 AGGCCATCCT GACGGATGGC CTTTTTGCGT TTCTACAAAC TCTTTTTGTT TATTTTTCTA 
3 481 AAT AC ATT C A AATATGTATC CGCTCATGAG AC AATAAC C C TGATAAATGC TT C AAT AAT A 
3 541 TTGAAAAAGG AAGAGTATGA GTATTCAACA TTTCCGTGTC GCCCTTATTC CCTTTTTTGC 
3 6 01 GGCATTTTGC CTTCCTGTTT TTGCTCACCC AGAAACGCTG GTGAAAGTAA AAG ATG CTG A 
3 661 AGATCAGTTG GGTGCACGAG TGGGTTACAT CGAACTGGAT CTCAACAGCG GTAAGATCCT 
3 721 TGAGAGTTTT CGCCCCGAAG AACGTTTTCC AATG ATG AG C ACTTTTAAAG TTCTGCTATG 
3 781 TGGCGCGGTA TTATCCCGTG TTGACGCCGG GCAAGAGCAA CTCGGTCGCC G C ATA C ACTA 
3 841 TTCTCAGAAT GACTTGGTTG AGTACTCACC AG T C AC AGAA AAG CAT C TT A CGGATGGCAT 
3 901 GACAGTAAGA G AATTATG C A GTGCTGCCAT AACCATGAGT GATAACACTG CGGCCAACTT 

3 961 ACTTCTGACA ACGATCGGAG GAC CGAAGGA GCTAACCGCT TTTTTGCACA ACATGGGGGA 

4 021 TCATGTAACT CGCCTTGATC GTTGGGAACC GGAGCTGAAT GAAGC CATAC CAAACGACGA 
4 081 GCGTGACACC ACGATGCCTA C AG C AATG G C AACAACGTTG CGCAAACTAT TAACTGG CGA 
4 141 ACTACTTACT CTAGCTTCCC GGCAACAATT AATAGACTGG ATGGAGGCGG ATAAAGTTGC 
42 01 AGGAC C AC TT CTGCGCTCGG CCCTTCCGGC TGGCTGGTTT ATTGCTGATA AATCTGGAGC 
42 61 CGGTGAGCGT GGGTCTCGCG GTATCATTGC AGCACTGGGG CCAGATGGTA AGCCCTCCCG 
4 321 TATCGTAGTT ATCTACACGA CGGGGAGTCA GGCAACTATG GATGAACGAA ATAGACAGAT 
4 381 CGCTGAGATA GGTGCCTCAC TGATTAAG C A TTGGTAACTG TCAGACCAAG TTTACTCATA 
44 41 TATACTTTAG ATTGATTTAA AACTTCATTT TTAATTTAAA AGGATCTAGG T G AAG AT C C T 
4 501 TTTTGATAAT CTCATGACCA AAATCCCTTA ACGTGAGTTT TCGTTCCACT GAGCGTCAGA 
4 561 CCCCGTAGAA AAGATCAAAG GATCTTCTTG AGATCCTTTT TTTCTGCGCG T AAT CTG CTG 
46 21 CTTGCAAACA AAAAAACCAC CGCTACCAGC GGTGGTTTGT TTGCCGGATC AAG AG C T AC C 
46 81 AACTCTTTTT CCGAAGGTAA CTGGCTTCAG CAGAGCGCAG ATACCAAATA CTGTCCTTCT 
4 741 AGTGTAGCCG TAGTTAGGCC ACCACTTCAA GAACTCTGTA GCACCGCCTA CATACCTCGC 
4 8 01 TCTGCTAATC CTGTTACCAG TGGCTGCTGC CAGTGGCGAT AAGTCGTGTC TTACCGGGTT 
4 861 GGACTCAAGA CGATAGTTAC CGGATAAGGC GCAGCGGTCG GGCTGAACGG GGGGTTCGTG 
4 921 CACACAGCCC AGCTTGGAGC GAACGACCTA C AC CGAACTG AG AT AC C T AC AGCGTGAGCT 

4 981 ATGAGAAAGC GCCACGCTTC CCGAAGGGAG AAAGGCGGAC AGGTATCCGG TAAGCGGCAG 

5 041 GGTCGGAACA GGAGAGCGCA CGAGGGAGCT TCCAGGGGGA AACGCCTGGT AT C TTT AT AG 
5101 TCCTGTCGGG TTTCGCCACC TCTGACTTGA GCGTCGATTT TTGTGATGCT CGTCAGGGGG 
5161 GCGGAGCCTA TGGAAAAACG CCAGCAACGC GGCCTTTTTA CGGTTCCTGG CCTTTTGCTG 
5221 GCCTTTTGCT CACATGTTCT TTCCTGCGTT ATCCCCTGAT TCTGTGGATA AC CGTATTAC 

52 81 CGCCTTTGAG TGAGCTGATA CCGCTCGCCG CAGCCGAACG ACCGAGCGCA GCGAGTCAGT 

53 41 GAGCGAGGAA GCGGAAGAGC GCCTGATGCG GTATTTTCTC CTTACGCATC TGTGCGGTAT 

54 01 TTCACACCGC ATAATTTTGT TAAAATTCGC GTTAAATTTT TGTTAAATCA GCTCATTTTT 
54 6 1 > TAACCAATAG GCCGAAATCG GCAAAATCCC T TATAAAT C A AAAGAATAGA C C GAG AT AG G 
5 521 GTTGAGTGTT GTTCCAGTTT GGAACAAGAG TCCACTATTA AAGAACGTGG ACTCCAACGT 
5 581 CAAAGGGCGA AAAACCGTCT ATCAGGGCGA TGGCCCACTA CGTGAACCAT CACCCTAATC 
5 641 AAGTTTTTTG GGGTCGAGGT GCCGTAAAGC ACTAAATCGG AACCCTAAAG GGAGCCCCCG 

57 01 ATTTAGAGCT TGACGGGGAA AGCCGGCGAA CGTGGCGAGA AAGGAAGGGA AGAAAGCGAA 

5 761 AGGAGCGGGC GCTAGGGCGC TGGCAAGTGT AGCGGTCACG CTGCGCGTAA CCACCACACC 
5821 CGCCGCGCTT AATGCGCCGC TACAGGGCGC G T C C C ATT C G CCATTCAGGC TGCTATGGTG 

58 81 CACTCTCAGT ACAATCTGCT CTGATGCCGC AT AG TTAAG C CAGTATACAC TCCGCTATCG 
5941 CTACGTGACT GGGTCATGGC TGCGCCCCGA CACCCGCCAA CACCCGCTGA CGCGCCCTGA 

6 001 CGGGCTTGTC TGCTCCCGGC ATCCGCTTAC AG AC AAG CTG TGACCGTCTC CGGGAGCTGC - 
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6 061 ATGTGTCAGA GGTTTTCACC GTCATCACCG AAACGCGCGA G G C AG C AG AT CAATTCGCGC 
6121 GCGAAGGCGA AGCGGCATGC ATTTACGTTG AC AC CAT C G A ATGGTGCAAA ACCTTTCGCG 
6181 GTATGGCATG ATAGCGCCCG GAAGAG AGT C AATTCAGGGT GGTGAATGTG AAACCAGTAA 
6 241 CGTTATACGA TG T C G C AG AG TATGCCGGTG TCTCTTATCA GACCGTTTCC CGCGTGGTGA 
63 01 ACCAGGCCAG CCACGTTTCT GCGAAAACGC GGGAAAAAGT GGAAGCGGCG ATGGCGGAGC 
6 361 TGAATTACAT TCCCAACCGC GTGGCACAAC AACTGGCGGG CAAACAGTCG TTGCTGATTG 
6 421 GCGTTGCCAC CTCCAGTCTG GCCCTGCACG CGCCGTCGCA AATTGTCGCG GCGATTAAAT 
6 481 CTCGCGCCGA TCAACTGGGT GCCAGCGTGG TGGTGTCGAT GGTAGAACGA AGCGGCGTCG 
6 54 1 AAGCCTGTAA AGC 
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Figure 2jft; T 7>&T3 



GST fusions in E. coli 
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pDEST3 6823 bp 



Location (Base Nos 



150 . 
1087 
1337 
2116 
2338 
2684 
3231 
5295 



200 
. 963 
. 1996 
.2200 
.2643 
.2808 
.4091 
. 6254 



Gene Encoded 

Trc 

attRl 

CmR 

inactivated ccdA 

ccdB 

attR2 

ampR 

laclq 



1 ACGTTATCGA CTGCACGGTG CACCAATGCT TCTGGCGTCA GGCAGCCATC GGAAGCTGTG 
61 GTATGGCTGT GCAGGTCGTA AATCACTGCA TAATTCGTGT CGCTCAAGGC GCACTCCCGT 
121 TCTGGATAAT GTTTTTTGCG C C G AC AT CAT AACGGTTCTG GCAAATATTC TGAAATGAGC 
181 TGTTGACAAT TAATCATCGG CTCGTATAAT GTGTGGAATT GTGAGCGGAT AACAATTTCA 
241 CACAGGAAAC AGTATTCATG TCCCCTATAC TAGGTTATTG GAAAATTAAG GGCCTTGTGC 

3 01 AACCCACTCG ACTTCTTTTG GAATATCTTG AAGAAAAATA TG AAG AG CAT TTGTATGAGC 
361 GCGATGAAGG TG ATAAATG G CGAAACAAAA AG TTTGAATT GGGTTTGGAG TTTCCCAATC 
421 TTCCTTATTA TATTGATGGT GATG TTAAAT TAACACAGTC TATGGCCATC ATACGTTATA 

4 81 TAG CTGACAA GCACAACATG TTGGGTGGTT GTCCAAAAGA GCGTGCAGAG ATTTCAATGC 
541 TTGAAGGAGC GGTTTTGGAT ATTAGATACG GTGTTTCGAG AATTGCATAT AG T AAAG ACT 
6 01 TTGAAACTCT CAAAGTTGAT TTTCTTAGCA AGCTACCTGA AATGCTGAAA ATGTTCGAAG 
661 ATCGTTTATG TCATAAAACA TATTTAAATG GTGATCATGT AACCCATCCT GACTTCATGT 
721 TGTATGACGC TCTTGATGTT GTTTTATACA TGGACCCAAT GTGCCTGGAT GCGTTCCCAA 
781 AATTAGTTTG TTTTAAAAAA C GTATTG AAG CTATCCCACA AATTGATAAG TACTTGAAAT 
841 CCAGCAAGTA TATAG CATGG CCTTTGCAGG GCTGGCAAGC CACGTTTGGT GGTGGCGACC 
901 ATCCTCCAAA ATCGGATCTG GTTCCGCGTG GATCTCGTCG TGCATCTGTT GGATC C C CAT 
961 CAACAAGTTT GTACAAAAAA GCTGAACGAG AAACGTAAAA TGATATAAAT AT C AAT AT AT 

1021 TAAATTAGAT TTTGCATAAA AAACAGACTA CAT AAT AC TG TAAAACACAA CATATC C AGT 
1081 CACTATGGCG GCCGCTAAGT TGGCAGCATC ACCCGACGCA CTTTGCGCCG AATAAATACC 
1141 TGTGACGGAA GATCACTTCG CAGAATAAAT AAATCCTGGT GTCCCTGTTG ATACCGGGAA 
12 01 GCCCTGGGCC AACTTTTGGC GAAAATGAGA CGTTGATCGG C AC GTAAGAG GTTCCAACTT 

12 61 TCACCATAAT GAAATAAGAT CACTACCGGG CGTATTTTTT GAGTTATCGA GATTTTCAGG 

13 21 AGCTAAGGAA GCTAAAATGG AGAAAAAAAT CACTGGATAT ACCACCGTTG ATATATCCCA 
13 81 ATGGCATCGT AAAGAACATT TTGAGGCATT TCAGTCAGTT GCTCAATGTA CCTATAACCA 
1441 GACCGTTCAG CTGGATATTA CGGCCTTTTT AAAGACCGTA AAGAAAAATA AG C AC AAGTT 

15 01 TTATCCGGCC TTTATTCACA TTCTTGCCCG CCTGATGAAT GCTCATCCGG AATTC CGTAT 
1561 GGCAATGAAA GACGGTGAGC TGGTGATATG GGATAGTGTT CACCCTTGTT ACACCGTTTT 
1621 CCATGAGCAA ACTGAAACGT TTTCATCGCT CTGGAGTGAA TACCACGACG ATTTCCGGCA 

16 81 GTTTCTACAC AT AT ATT C G C AAGATGTGGC GTGTTACGGT GAAAACCTGG CCTATTTCCC 
1741 TAAAGGGTTT ATTGAGAATA TGTTTTTCGT C T C AG C C AAT CCCTGGGTGA GTTTCACCAG 
18 01 TTTTGATTTA AACGTGGCCA ATATGGACAA CTTCTTCGCC CCCGTTTTCA CCATGGGCAA 
1861 ATATTATACG CAAGGCGACA AGGTGCTGAT GCCGCTGGCG ATTCAGGTTC ATCATGCCGT 

,1921 CTGTGATGGC TTCCATGTCG GCAGAATGCT TAATGAATTA CAACAGTACT GCGATGAGTG 
1981 GCAGGGCGGG GCGTAAAGAT CTGGATCCGG CTTACTAAAA GCCAGATAAC AGT AT G C G TA 
2 041 TTTGCGCGCT GATTTTTGCG GTATAAGAAT ATATACTGAT ATGTATACCC GAAGTATGTC 
2101 AAAAAG AG GT GTGCTATGAA GCAGCGTATT ACAGTGACAG TTG AC AG C G A C AG C TAT C AG 
2161' TTGCTCAAGG CATATATGAT GT C AAT AT C T CCGGTCTGGT AAGCACAACC ATGCAGAATG 
2221 AAGCCCGTCG TCTGCGTGCC GAACGCTGGA AAGCGGAAAA TCAGGAAGGG ATGGCTGAGG 
2 281 TCGCCCGGTT TATTGAAATG AACGGCTCTT TTGCTGACGA GAACAGGGAC TGGTGAAATG 
2 341 CAGTTTAAGG TTTAC AC C TA TAAAAGAGAG AGCCGTTATC GTCTGTTTGT GGATGTACAG 
2 4 01 AGTGATATTA TTGACACGCC CGGGCGACGG ATGGTGATCC CCCTGGCCAG TGCACGTCTG 
2 461 CTGTCAGATA AAGTCTCCCG TGAACTTTAC CCGGTGGTGC ATATCGGGGA TGAAAGCTGG 
2 521 CGCATGATGA C C AC C GAT AT GGCCAGTGTG CCGGTCTCCG TTATCGGGGA AGAAGTGGCT 
2 581 GATCTCAGCC ACCGCGAAAA TGACATCAAA AACGCCATTA ACCTGATGTT CTGGGGAATA 
2 641 TAAATGTCAG GCTCCCTTAT AC AC AG C C AG TCTGCAGGTC GAC CATAGTG ACTGGATATG - 



WO 00/52027 , PCT/US00/05432 



2 701 TTGTGTTTTA CAGTATTATG TAGTCTGTTT TTTATGCAAA ATCTAATTTA ATATATTGAT 
2761 ATTTATATCA TTTTACGTTT CTCGTTCAGC TTTCTTGTAC AAAGTGGTTG ATGGGAATTC 
2 821 ATCGTGACTG ACTGACGATC TGCCTCGCGC GTTTCGGTGA TGACGGTGAA AACCTCTGAC 
2 8 81 AC ATG CAGCT CCCGGAGACG GTCACAGCTT GTCTGTAAGC GGATGCCGGG AGCAGACAAG 

2 941 CCCGTCAGGG CGCGTCAGCG GGTGTTGGCG GGTGTCGGGG CGCAGCCATG ACCCAGTCAC 

3 001 G TAG C GAT AG CGGAGTGTAT AATTCTTGAA GACGAAAGGG CCTCGTGATA CGCCTATTTT 
3 061 TAT AG G TTAA TGTCATGATA ATAATGGTTT CTTAGACGTC AGGTGGCACT TTTCGGGGAA 
3121 ATGTGCGCGG AACCCCTATT TGTTTATTTT T C TAAATAC A TTCAAATATG TATCCGCTCA 
3181 TGAGACAATA ACCCTGATAA ATGCTTCAAT AATATTGAAA AAGGAAGAGT ATGAGTATTC 
32 41 AACATTTCCG TGTCGCCCTT ATTCCCTTTT TTGCGGCATT TTGCCTTCCT GTTTTTGCTC 
3 3 01 ACCCAGAAAC G CTGGTGAAA GTAAAAGATG CTGAAGATCA GTTGGGTGCA CGAGTGGGTT 
3361 ACATCGAACT GGATCTCAAC AGCGGTAAGA T C CTTG AG AG TTTTCGCCCC GAAGAACGTT 
3421 TTCCAATGAT GAGCACTTTT AAAGTTCTGC TATGTGGCGC GGTATTATCC CGTGTTGACG 
34 81 CCGGGCAAGA GCAACTCGGT CGCCGCATAC ACTATTCTCA GAATGACTTG GTTGAGTACT 
3 541 CACCAGTCAC AGAAAAGCAT CTTACGGATG GCATGACAGT AAGAGAATTA TGCAGTGCTG 
36 01 C C AT AAC CAT GAGTGATAAC ACTGCGGCCA ACTTACTTCT GACAACGATC GGAGGAC CGA 
3661 AGGAG CTAAC CGCTTTTTTG CACAACATGG GGGATCATGT AACTCGCCTT GATCGTTGGG 
3 721 AACCGGAGCT GAATGAAGCC ATAC CAAACG ACGAGCGTGA C AC CACGATG CCTGCAGCAA 
3 781 TGGCAACAAC GTTGCGCAAA CTATTAACTG GCGAACTACT TACTCTAGCT TCCCGGCAAC 
3 841 AATTAATAGA CTGGATGGAG GCGGATAAAG TTGCAGGACC ACTTCTGCGC TCGGCCCTTC 
3 9 01 CGGCTGGCTG GTTTATTG CT GATAAATCTG GAGCCGGTGA GCGTGGGTCT CGCGGTATCA 

3 961 TTGCAGCACT GGGGCCAGAT GGTAAGCCCT CCCGTATCGT AGTTATCTAC ACGACGGGGA 

4 021 GTCAGGCAAC TATGGATGAA C G AAAT AG AC AGATCGCTGA GATAGGTGCC T C AC TG ATT A 
4 081 AG C ATTGGTA ACTGTCAGAC CAAGTTTACT CATATATACT TTAG AT TG AT TTAAAACTTC 
4141 ATTTTTAATT T AAAAGG AT C TAGGTGAAGA TC CTTTTTG A TAATCTCATG ACCAAAATCC 
42 01 CTTAACGTGA GTTTTCGTTC CACTGAG CGT CAGACCCCGT AG AAAAG AT C AAAGG AT CTT 

42 61 CTTGAGATCC TTTTTTTCTG CGCGTAATCT GCTGCTTGCA AACAAAAAAA CCACCGCTAC 
4 321 CAGCGGTGGT TTGTTTGCCG GATC AAGAG C TACCAACTCT TTTTCCGAAG GTAACTGGCT 

43 81 TCAGCAGAGC GCAGATACCA AATACTGTCC TTCTAGTGTA GCCGTAGTTA GGCCACCACT 
4441 TCAAGAACTC TGTAGCACCG CCTACATACC TCGCTCTGCT AATCCTGTTA CCAGTGGCTG 

45 01 CTGCCAGTGG CGATAAGTCG TGTCTTACCG GGTTGGACTC AAG AC GAT AG TTACCGGATA 
4 561 AGGCGCAGCG GTCGGGCTGA ACGGGGGGTT CGTGCACACA GCCCAGCTTG GAGCGAACGA 
4 621 CCTACACCGA AC TG AG ATAC CTACAGCGTG AG CTATG AG A AAGCGCCACG CTTCCCGAAG 

46 81 GGAG AAAGG C GGACAGGTAT CCGGTAAGCG GCAGGGTCGG AACAGGAGAG CGCACGAGGG 
4 741 AGCTTCCAGG GGGAAACGCC TGGTATCTTT ATAGTCCTGT CGGGTTTCGC CACCTCTGAC 
48 01 TTGAGCGTCG ATTTTTGTGA TGCTCGTCAG GGGGGCGGAG CCTATGGAAA AACGCCAGCA 
4 861 ACGCGGCCTT TTTACGGTTC CTGGCCTTTT GCTGGCCTTT TGCTCACATG TTCTTTCCTG 
4 921 CGTTATCCCC TGATTCTGTG GATAACCGTA TTACCGCCTT TGAGTGAGCT GATACCGCTC 

4 981 GCCGCAGCCG AACGACCGAG CGCAGCGAGT CAGTGAGCGA GGAAGCGGAA GAGCGCCTGA 

5 041 TGCGGTATTT ' TCTCCTTACG CATCTGTGCG GTATTTCACA C C G CAT AAAT TCCGACACCA 
5101 TCGAATGGTG CAAAACCTTT CGCGGTATGG CATGATAGCG C C C G G AAG AG AGTCAATTCA 
5161 GGGTGGTGAA TGTGAAACCA GTAACGTTAT ACGATGTCGC AG AG TATG C C GGTGTCTCTT 
52 21 AT C AG AC CGT TTCCCGCGTG GTGAACCAGG CCAGCCACGT TTCTGCGAAA ACGCGGGAAA 
52 81 AAGTGGAAG C GGCGATGGCG GAGCTGAATT ACATTCCCAA CCGCGTGGCA CAACAACTGG 
5 341 CGGGCAAACA GTCGTTGCTG ATTGGCGTTG CCACCTCCAG TCTGGCCCTG CACGCGCCGT 
54 01 CGCAAATTGT CGCGGCGATT AAATCTCGCG C C GAT C AAC T GGGTGCCAGC GTGGTGGTGT 

54 61 CGATGGTAGA ACGAAGCGGC GTCGAAGCCT GTAAAGCGGC GGTGCACAAT CTTCTCGCGC 
5 521 AACGCGTCAG TGGGCTGATC ATTAACTATC CGCTGGATGA CCAGGATGCC ATTGCTGTGG 

55 81- AAGCTGCCTG C AC TAATGTT CCGGCGTTAT TTCTTGATGT CTCTGACCAG ACACCCATCA 

56 41 ACAGTATTAT TTTCTCCCAT GAAGACGGTA CGCGACTGGG C G TG GAG CAT CTGGTCGCAT 

57 01 TGGGTCACCA GCAAATCGCG CTGTTAGCGG G C C C ATT AAG TTCTGTCTCG GCGCGTCTGC 

57 61 GTCTGGCTGG CTGGCATAAA TATCTCACTC G CAATC AAAT TCAGCCGATA GCGGAACGGG 
5821 AAGGCGACTG GAGTGC CATG TCCGGTTTTC AACAAACCAT GCAAATGCTG AATGAGGGCA 

58 81 TCGTTCCCAC TGCGATGCTG GTTGCCAACG ATCAGATGGC GCTGGGCGCA ATGCGCGCCA 

5 941 TTACCGAGTC CGGGCTGCGC GTTGGTGCGG ATATCTCGGT AGTGGGATAC GACGATACCG 

6 001 AAGACAGCTC ATGTTATATC CCGCCGTTAA CCACCATCAA ACAGGATTTT CGCCTGCTGG 
6 061 GGCAAACCAG CGTGGACCGC TTGCTGCAAC TCTCTCAGGG CCAGGCGGTG AAGGG CAATC 
6121 AGCTGTTGCC CGTCTCACTG GTGAAAAGAA AAACCACCCT GGCGCCCAAT ACGCAAACCG- 
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6181 CCTCTCCCCG 
6 241 AAAGCGGGCA 

63 01 GCTTTACACT 
6 361 C AC AC AG G AA 
6 421 TGACTGGGAA 

64 81 CAGCTGGCGT 
6 541 GAATGGCGAA 
6 6 01 GGAGTG CG AT 

66 61 TTACGATGCG 

67 21 TCCCACGGAG 
67 81 ACAGGAAGGC 



CGCGTTGGCC 
GTGAGCGCAA 
TTATGCTTCC 
AC AG CTATG A 
AACCCTGGCG 
AATAGCGAAG 
TGGCGCTTTG 
CTTCCTGAGG 
CCCATCTACA 
AATC CGACGG 
CAGACGCGAA 



GATTCATTAA 
CGCAATTAAT 
GGCTCGTATG 
CCATGATTAC 
TTACCCAACT 
AGGCCCGCAC 
CCTGGTTTCC 
CCGATACTGT 
CCAACGTAAC 
GTTGTTACTC 
TTATTTTTGA 



TGCAGCTGGC 
GTGAGTTAGC 
TTGTGTGGAA 
GGATTCACTG 
TAATCGCCTT 
CGATCGCCCT 
GGCACCAGAA 
CGTCGTCCCC 
CTATCCCATT 
GCTCACATTT 
TGGCGTTGGA 



ACGACAGGTT 
T C AC T CAT TA 
TTGTGAGCGG 
GCCGTCGTTT 
G C AG C AC AT C 
TCCCAACAGT 
GCGGTGCCGG 
TCAAACTGGC 
ACGGTCAATC 
AATGTTGATG 
ATT 



TCCCGACTGG 
GGCACCCCAG 
ATAACAATTT 
TACAACGTCG 
CCCCTTTCGC 
TGCGCAGCCT 
AAAGCTGGCT 
AGATG CACGG 
CGCCGTTTGT 
AAAGCTGGCT 
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Figure 2^?i>S jr Y 



His6~thioredoxin fusions in E. coli 



9X9 gc* a*t att ctg aaa tga get gfct $M At taa tea tec ggt ccg tab aat 
cgt tta taa gac ttt act cga c ^a ctg tjt a att agt agg cca ggc fct« tta 



97 & ctg tgglaat tgt gag egg ata aca att tea cac agg aaa cag acc Stg Jjfc 
gac acc tta aca etc gee tat tgt taa agt gtg tec ttt gtc tgg tac cca 



"21 &fc &l <& &t IE & lac ^ ife fea Jfe g% ^ 

gta gta gta gta gta gtg eta atg eta tag ggt tgc tgg ctt ttg gac ata 



•35 <±u 



lOZi. f& SS^«* & cYtSfg a£ ala Sfe &\ &c ttfr Tfe &c & & 
aaa gtc ccg egg gta tac teg eta ttt taa taa gtg ga? tga ctg ctg til 



eta ctg eta ctg tte cat ggg tag fcet tea aiS *7Z^rr 
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pDEST4 6964 bp 



Location (Base Nos . 
964 . . 1003 
1577 . . 1453 



1827 
2606 
2828 
3174 
3872 
5378 
5778 
6587 



2486 
,2690 
.3133 
. 3298 
,4777 
.5538 
.6215 
.704 



Gene Encoded 

Trc 

attRl 

CmR 

inactivated ccdA 

ccdB 

attR2 

ampR 

ori 

flori (fl intergenic region) 
laclq 



1 CTATCCGCTG GATGACCAGG ATGCCATTGC 
61 GTTATTTCTT GATGTCTCTG ACCAGACACC 
121 CGGTACGCGA CTGGGCGTGG AG C ATCTGGT 
181 AGCGGGCCCA TTAAGTTCTG TCTCGGCGCG 
241 CACTCGCAAT CAAATTCAGC CGATAGCGGA 
3 01 TTTTCAACAA AC C ATG C AAA TGCTGAATGA 
361 CAACGATCAG ATGGCGCTGG GCGCAATGCG 
421 TGCGGATATC TCGGTAGTGG GATACGACGA 
481 GTCAACCACC ATCAAACAGG ATTTTCGCCT 
541 GCAACTCTCT CAGGGCCAGG CGGTGAAGGG 
6 01 AAGAAAAACC ACCCTGGCAC CCAATACGCA 
6 61 ATTAATGCAG CTGGCACGAC AGGTTTCCCG 
721 TTAATGTGAG TTAGCGCGAA TTGATCTGGT 
781 CCAATGCTTC TGG CGTCAGG CAGCCATCGG 
841 T C AC TG C ATA ATTCGTGTCG CTCAAGGCGC 
9 01 G AC AT C AT AA CGGTTCTGGC AAATATTCTG 
961 CCGTATAATC TGTGGAATTG TGAG CGGATA 
1021 CAT CAT CAT C ATCATCACGA TTACGATATC 
1081 G C C CAT ATG A GCGATAAAAT TATTC AC CTG 
1141 AAAGCGGACG GGGCGATCCT CGTCGATTTC 
12 01 ATCGCCCCGA TTCTGGATGA AATCGCTGAC 

12 61 CTGAACATCG ATCAAAACCC TGGCACTGCG 
1321 CTGCTGCTGT TCAAAAACGG TGAAGTGGCG 

13 81 CAGTTGAAAG AGTTCCTCGA CGCTAACCTG 
1441 AAGGTACCCA TCACAAGTTT GTACAAAAAA 

15 01 AT C AATATAT T AAAT TAG AT TTTGCATAAA 
1561 CAT AT C C AGT CACTATGGCG GCCGCTAAGT 
1621 AATAAAT AC C TGTGACGGAA GATCACTTCG 

16 81 ATACCGGGAA GCCCTGGGCC AACTTTTGGC 
1741 GTTCCAACTT TCACCATAAT GAAATAAGAT 
18 01 GATTTTCAGG AG CTAAGG AA G C TAAAATG G 
18 61 ATATATCCCA ATGGCATCGT AAAGAACATT 
1921 CCTATAACCA GACCGTTCAG C TG GAT ATT A 
1981 AG CACAAGTT TTATCCGGCC TTTATT C AC A 
2 0 4 1 : - AATTCCGTAT GGCAATGAAA GACGGTGAGC 
2101 ACACCGTTTT C C ATG AG C AA ACTGAAACGT 
2161 ATTTCCGGCA GTTTCTACAC ATATATTCGC 
2221 CCTATTTCCC TAAAGGGTTT ATTGAGAATA 
2 281 GTTTCAC CAG TTTTGATTTA AACGTGGCCA 
2 341 CCATGGGCAA AT ATT ATAC G CAAGGCGACA 
2 4 01 ATCATGCCGT CTGTGATGGC TTCCATGTCG 
24 61 GCGATGAGTG GCAGGGCGGG GCGTAAACGC 
2 521 AGTATGCGTA TTTGCGCGCT GATTTTTGCG 



TGTGGAAGCT GCCTGCACTA ATGTTCCGGC 
CAT C AAC AG T ATTATTTTCT CCCATGAAGA 
CGCATTGGGT C AC CAG C AAA TCGCGCTGTT 
TCTGCGTCTG GCTGGCTGGC AT AAAT AT C T 
ACGGGAAGGC GACTGGAGTG CCATGTCCGG 
GGGCATCGTT CCCACTGCGA TGCTGGTTGC 
CGCCATTACC GAGTCCGGGC TGCGCGTTGG 
TAC CGAAGAC AGCTCATGTT ATATCCCGCC 
GCTGGGGCAA ACCAGCGTGG ACCGCTTGCT 
CAATCAGCTG TTGCCCGTCT CACTGGTGAA 
AACCGCCTCT CCCCGCGCGT TGGCCGATTC 
ACTGGAAAGC GGGCAGTGAG CGCAACGCAA 
TTGACAGCTT ATCATCGACT GCACGGTGCA 
AAGCTGTGGT ATGGCTGTGC AGGTCGTAAA 
ACTCCCGTTC TGG ATAATG T TTTTTGCGCC 
AAATG AG CTG TTGACAATTA ATCATCCGGT 
ACAATTTCAC ACAGGAAACA GACCATGGGT 
CCAACGACCG AAAACCTGTA TTTTCAGGGC 
ACTGACGACA GTTTTGACAC GGATGTACTC 
TGGGCAGAGT GGTGCGGTCC GTGCAAAATG 
GAATATCAGG GCAAACTGAC CGTTGCAAAA 
C C G AAAT ATG GCATCCGTGG TATCCCGACT 
GCAACCAAAG TGGGTGCACT GTCTAAAGGT 
GCCGGTTCTG GTTCTGGTGA TGACGATGAC 
GCTGAACGAG AAACGTAAAA TGATATAAAT 
AAACAGACTA CATAATACTG TAAAACACAA 
TGGCAGCATC ACCCGACGCA CTTTGCGCCG 
CAGAATAAAT AAATCCTGGT GTCCCTGTTG 
GAAAATGAGA CGTTGATCGG CACGTAAGAG 
CACTACCGGG CGTATTTTTT GAGTTATCGA 
AGAAAAAAAT CACTGGATAT ACCACCGTTG 
TTGAGGCATT TCAGTCAGTT GCTCAATGTA 
CGGCCTTTTT AAAGACCGTA AAGAAAAATA 
TTCTTGCCCG CCTGATGAAT GCTCATCCGG 
TGGTGATATG GGATAGTGTT CACCCTTGTT 
TTTCATCGCT CTG GAGTG AA TACCACGACG 
AAGATGTGGC GTGTTACGGT GAAAACCTGG 
TGTTTTTCGT CTCAGCCAAT CCCTGGGTGA 
AT ATG G AC AA CTTCTTCGCC CCCGTTTTCA 
AGGTGCTGAT GCCGCTGGCG ATTCAGGTTC 
GCAGAATGCT TAATGAATTA C AAC AG TACT 
GTGGATCCGG CTTACTAAAA G C C AG AT AA C 
GTATAAGAAT ATATACTGAT ATGTATACCC- 
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2 581 GAAGTATGTC AAAAAGAGGT GTGCTATGAA GCAGCGTATT ACAGTGACA3 TTGACAGCGA 
2 641 C AG C TAT C AG TTGCTCAAGG CATATATGAT GTCAATATCT CCGGTCTGGT AAGCACAACC 
27 01 ATGCAGAATG AAGCCCGTCG TCTGCGTGCC GAACGCTGGA AAGCGGAAAA TCAGGAAGGG 
2761 ATGGCTGAGG TCGCCCGGTT TATTGAAATG AACGGCTCTT TTGCTGACGA GAACAGGGAC 
2 821 TGGTGAAATG CAGTTTAAGG TTTACACCTA TAAAAGAGAG AGCCGTTATC GTCTGTTTGT 
2881 GGATGTACAG AG TG AT ATT A TTGACACGCC CGGGCGACGG ATGGTGATCC CCCTGGCCAG 

2 941 TGCACGTCTG CTGTCAGATA AAGTCTCCCG TGAACTTTAC CCGGTGGTGC ATATCGGGGA 

3 001 TGAAAGCTGG CGCATGATGA CCACCGATAT GGCCAGTGTG CCGGTCTCCG TTATCGGGGA 
3 061 AGAAGTGGCT GATCTCAGCC ACCGCGAAAA TGACATCAAA AACGCCATTA ACCTGATGTT 
3121 CTGGGGAATA T AAATGT C AG GCTCCCTTAT ACACAG CCAG TCTGCAGGTC GAC CATAGTG 
3181 ACTG G ATATG TTGTGTTTTA CAGTATTATG TAGTCTGTTT TTTATG C AAA ATCTAATTTA 
3241 ATATATTGAT ATTTATAT C A TTTTACGTTT CTCGTTCAGC TTTCTTGTAC AAAGTG GTGA 

33 01 TGGGGATCCT CTAGAGTCGA CCTGCAGTAA TCGTACAGGG TAG T AC AAAT AAAAAAGGCA 
3 361 CGTCAGATGA CGTGCCTTTT TTCTTGTGAG CAGTAAGCTT GGCTGTTTTG GCGGATGAGA 
3 421 GAAGATTTTC AGCCTGATAC AG ATT AAAT C AGAACGCAGA AGCGGTCTGA TAAAACAGAA 

34 81 TTTGCCTGGC GGCAGTAGCG CGGTGGTCCC ACCTGACCCC ATGCCGAACT CAGAAGTGAA 
3 541 ACGCCGTAGC GCCGATGGTA GTGTGGGGTC TCCCCATGCG AGAGTAGGGA ACTGCCAGGC 
3 601 AT C AAAT AAA ACGAAAGGCT CAGTCGAAAG ACTGGGCCTT TCGTTTTATC TGTTGTTTGT 
3 661 CGGTGAACGC TCTCCTGAGT AGGACAAATC CGCCGGGAGC GGATTTGAAC GTTGCGAAGC 
3 721 AACGGCCCGG AGGGTGGCGG GCAGGACGCC CGCCATAAAC TG C C AG G CAT CAAATTAAGC 
3 781 AGAAGG C CAT CCTGACGGAT GGCCTTTTTG CGTTTCTACA AACTCTTTTT GTTTATTTTT 
3 841 C T AAAT AC AT TCAAATATGT ATCCGCTCAT GAGACAATAA C C C TG ATAAA TGCTTCAATA 
3 901 ATATTGAAAA AGGAAGAGTA TG AG TATTC A ACATTTCCGT GTCGCCCTTA TTCCCTTTTT 

3 961 TGCGGCATTT TGCCTTCCTG TTTTTGCTCA CCCAGAAACG CTGGTGAAAG TAAAAGATGC 

4 021 TGAAGATCAG TTGGGTGCAC GAGTGGGTTA CATCGAACTG GATCTCAACA GCGGTAAGAT 
4 081 CCTTGAGAGT TTTCGCCCCG AAGAACGTTT TCCAATGATG AGCACTTTTA AAGTTCTGCT 
4141 ATGTGGCGCG GTATTATCCC GTGTTGACGC CGGGCAAGAG CAACTCGGTC GCCGCATACA 
42 01 CTATTCTCAG AATGACTTGG TTGAGTACTC ACCAGTCACA G AAAAG CAT C TTACGGATGG 
4 261 CATGACAGTA AGAGAATTAT GCAGTGCTGC CATAACCATG AGTGATAACA CTGCGGCCAA 
4 321 CTTACTTCTG ACAACGATCG GAGGAC CGAA GGAGCTAACC G CTTTTTTG C ACAACATGGG 
4381 GGATCATGTA ACTCGCCTTG ATCGTTGGGA ACCGGAGCTG AATGAAG C C A TACCAAACGA 
4 441 CGAGCGTGAC ACCACGATGC CTACAG CAAT GGCAACAACG TTGCGCAAAC TATTAACTGG 
4 5 01 CGAACTACTT ACTCTAGCTT CCCGGCAACA ATTAATAGAC TGGATGGAGG CGGATAAAGT 
4 561 TG C AG GAC C A CTTCTGCGCT CGGCCCTTCC GGCTGGCTGG TTTATTGCTG AT AAAT C TG G 
4 621 AGCCGGTGAG CGTGGGTCTC GCGGTATCAT TGCAGCACTG GGGCCAGATG GTAAGCCCTC 
46 81 CCGTATCGTA GTTATCTACA CGACGGGGAG TCAGGCAACT ATGGATGAAC GAAATAGACA 
4 741 GATCGCTGAG ATAGGTGCCT CACTGATTAA GCATTGGTAA CTGTCAGACC AAGTTTACTC 
48 01 ATATATACTT TAGATTGATT TAAAACTTCA TTTTTAATTT AAAAGGATCT AGGTGAAGAT 
48 61 CCTTTTTGAT AATCTCATGA CCAAAATCCC TTAACGTGAG TTTTCGTTCC ACTGAGCGTC 
4 921 AGACCCCGTA GAAAAGATCA AAGGATCTTC TT G AG AT C C T TTTTTTCTGC GCGTAATCTG 

4 981 CTGCTTGCAA ACAAAAAAAC CACCGCTACC AGCGGTGGTT TGTTTGCCGG ATCAAGAGCT 

5 041 ACCAACTCTT TTTCCGAAGG TAACTGGCTT CAGCAGAGCG CAGATACCAA ATACTGTCCT 
5101 TCTAGTGTAG CCGTAGTTAG GCCACCACTT CAAGAACTCT GTAGCACCGC CTACATACCT 
5161 CGCTCTGCTA ATCCTGTTAC CAGTGGCTGC TGCCAGTGGC GATAAGTCGT GTCTTACCGG 
5221 GTTGGACTCA AG AC G AT AGT TAC CGGATAA GGCGCAGCGG TCGGGCTGAA CGGGGGGTTC 

52 81 GTG C AC AC AG CCCAGCTTGG AGCGAACGAC CTACACCGAA CTGAGATACC TACAGCGTGA 

53 41 GCTATGAGAA AGCGCCACGC TTCCCGAAGG GAGAAAGGCG GAC AG G TAT C CGGTAAGCGG 

54 01 CAGGGTCGGA AC AG G AG AG C GCACGAGGGA GCTTCCAGGG GGAAACGCCT GGTATCTTTA 
5461- TAGTCCTGTC GGGTTTCGCC ACCTCTGACT TGAGCGTCGA TTTTTGTGAT GCTCGTCAGG 
5 521 GGGGCGGAGC CTATGGAAAA ACGCCAGCAA CGCGGCCTTT TTACGGTTCC TGGCCTTTTG 
5 581 CTGGCCTTTT GCTCACATGT TCTTTCCTGC GTTATCCCCT GATTCTGTGG ATAACCGTAT 
5641 TACCGCCTTT GAGTGAGCTG ATACCGCTCG CCGCAGCCGA ACGACCGAGC GCAGCGAGTC 

57 01 AGTG AG CG AG GAAGCGGAAG AGCGCCTGAT GCGGTATTTT CTCCTTACGC ATCTGTGCGG 
5761 TATTTCACAC CGCATAATTT TGTTAAAATT CGCGTTAAAT TTTTGTTAAA TCAGCTCATT 
5821 TTTTAACCAA TAGGCCGAAA TCGGCAAAAT CCCTTATAAA TCAAAAGAAT AGACCGAGAT 

58 81 AGGGTTGAGT GTTGTTCCAG TTTGGAACAA GAGTCCACTA TTAAAG AAC G TGGACTCCAA 

5 941 CGTCAAAGGG CGAAAAACCG TCTATCAGGG CGATGGCCCA CTACGTGAAC CATCACCCTA 

6 001 ATCAAGTTTT TTGGGGTCGA GGTGCCGTAA AG C ACTAAAT CGGAACCCTA AAGGGAGCCC - 
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6 061 CCGATTTAGA GCTTGACGGG GAAAGCCGGC GAACGTGGCG AGAAAGGAAG GGAAGAAAGC 
6121 GAAAGGAGCG GGCGCTAGGG CGCTGGCAAG TGTAGCGGTC ACGCTGCGCG TAACCACCAC 
6181 ACCCGCCGCG CTTAATGCGC CGCTACAGGG CGCGTCCATT CGCCATTCAG GCTGCTATGG 
6 241 TGCACTCTCA GTACAATCTG CTCTGATGCC GCATAGTTAA GCCAGTATAC ACTCCGCTAT 

63 01 CGCTACGTGA CTGGGTCATG GCTGCGCCCC GACACCCGCC AACACCCGCT GACGCGCCCT 
6 361 GACGGGCTTG TCTGCTCCCG GCATCCGCTT AC AG AC AAG C TGTGACCGTC TCCGGGAGCT 
6421 GCATGTGTCA GAGGTTTTCA CCGTCATCAC CGAAACGCGC GAGGCAGCAG ATCAATTCGC 

64 81 GCGCGAAGGC GAAGCGGCAT GCATTTACGT TGACACCATC GAATGGTGCA AAACCTTTCG 
6 541 CGGTATGGCA TGATAGCGCC CGGAAGAGAG TCAATTCAGG GTGGTGAATG TGAAACCAGT 
6 6 01 AACGTTATAC GATGTCGCAG AGTATGCCGG TGTCTCTTAT CAGAC CGTTT CCCGCGTGGT 
6 661 GAACCAGGCC AGCCACGTTT CTGCGAAAAC GCGGGAAAAA GTGGAAGCGG CGATGGCGGA 
6721 GCTGAATTAC ATTCCCAACC GCGTGGCACA ACAACTGGCG GGCAAACAGT CGTTGCTGAT 
6 781 TGGCGTTGCC ACCTCCAGTC TGGCCCTGCA CGCGCCGTCG CAAATTGTCG CGGCGATTAA 
6 841 ATCTCGCGCC GATCAACTGG GTGCCAGCGT GGTGGTGTCG ATGGTAGAAC GAAGCGGCGT 
6 901 CGAAGCCTGT AAAGCGGCGG TGCACAATCT TCTCGCGCAA CGCGTCAGTN GGGCTGATCA 
6 961 TTAA 
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attR2 

fl (fl intergenic region) 

ampR 
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1 AGGCACCCCA GGCTTTACAC TTTATGCTTC CGGCTCGTAT GTTGTGTGGA ATTGTGAGCG 
61 GATAACAATT TCACACAGGA AAC AG CTATG ACCATGATTA CGCCAAGCTC TAATACGACT 
121 C AC TATAGG G AAAGCTGGTA CGCCTGCAGG TACCGGTCCG GAATTCCCGG GTCGACGATC 
181 ACAAGTTTGT ACAAAAAAGC TGAACGAGAA ACGTAAAATG ATATAAATAT CAATATATTA 
241 AATTAGATTT TG C ATAAAAA ACAGACTACA TAATACTGTA AAACACAACA TATCCAGTCA 
3 01 CTATGGCGGC CGCTAAGTTG GC AG CATC AC CCGACGCACT TTGCGCCGAA TAAATACCTG 

3 61 TGACGGAAGA TCACTTCGCA GAATAAATAA ATCCTGGTGT CCCTGTTGAT ACCGGGAAGC 

4 21 CCTGGGCCAA CTTTTGGCGA AAATGAGACG TTGATCGGCA CGTAAGAGGT TCCAACTTTC 
481 AC C ATAATGA AATAAGAT C A CTACCGGGCG TATTTTTTGA GTTATCGAGA TTTTCAGGAG 
541 CTAAGGAAGC TAAAATGGAG AAAAAAATCA CTGGATATAC CACCGTTGAT ATATCCCAAT 
6 01 GGCATCGTAA AGAACATTTT GAGGCATTTC AGTCAGTTGC TCAATGTACC TATAACCAGA 
6 61 CCGTTCAGCT GGATATTACG GCCTTTTTAA AG AC CGTAAA GAAAAATAAG CACAAGTTTT 
721 ATCCGGCCTT TATTCACATT CTTGCCCGCC TGATGAATGC TCATCCGGAA TTCCGTATGG 
781 CAATGAAAGA CGGTGAGCTG GTGATATGGG ATAGTGTTCA CCCTTGTTAC ACCGTTTTCC 
841 ATGAGCAAAC TGAAACGTTT TCATCGCTCT GGAGTGAATA CCACGACGAT TTCCGGCAGT 
9 01 TT C TAC AC AT AT ATT C G C AA GATGTGGCGT GTTACGGTGA AAACCTGGCC TATTTCCCTA 
9 61 AAGGGTTTAT TGAGAATATG TTTTTCGTCT CAGCCAATCC CTGG GTGAGT TT C AC C AG TT 

1021 TTGATTTAAA CGTGGCCAAT ATGGACAACT TCTTCGCCCC CGTTTTCACC ATGGG CAAAT 
1081 ATTATACGCA AGGCGACAAG GTGCTGATGC CGCTGGCGAT T C AGGTT CAT CATGCCGTCT 
1141 GTGATGGCTT CCATGTCGGC AGAATG CTTA AT G AATT AC A ACAGTACTGC GATGAGTGGC 
12 01 AGGGCGGGGC GTAAACGCGT GGATCCGGCT TACTAAAAGC CAGATAACAG TATGCGTATT 

12 61 TGCGCGCTGA TTTTTGCGGT ATAAGAATAT ATACTGATAT GT AT AC C C G A AGTATGTCAA 
1321 AAAGAGGTGT GCTATGAAGC AG CGTATTAC AGTGACAGTT GACAGCGACA GCTATCAGTT 

13 81 GCTCAAGGCA TATATG AT G T CAATATCTCC GGTCTGGTAA G C AC AAC CAT GCAGAATGAA 

14 41 GCCCGTCGTC TGCGTGCCGA ACGCTGGAAA GCGGAAAATC AGGAAGGGAT GGCTGAGGTC 

15 01 GCCCGGTTTA TTGAAATGAA CGGCTCTTTT GCTGACGAGA ACAGGGACTG GTGAAATGCA 
1561 GTTTAAGGTT TACACCTATA AAAGAGAGAG CCGTTATCGT CTGTTTGTGG ATGTACAGAG 

16 21 TGATATTATT GACACGCCCG GGCGACGGAT GGTGATCCCC CTGGCCAGTG CACGTCTGCT 
16 81 GTCAGATAAA GTCTCCCGTG AACTTTAC C C GGTGGTGCAT ATCGGGGATG AAAGCTGGCG 
1741 CATGATGACC ACCGATATGG CCAGTGTGCC GGTCTCCGTT ATCGGGGAAG AAGTGGCTGA 
18 01 TCTCAGCCAC CGCGAAAATG ACATCAAAAA CGCCATTAAC CTGATGTTCT GGGGAATATA 
18 61 AATGTCAGGC TCCCTTATAC ACAGCCAGTC TGCAGGTCGA C C ATAGTGAC TGGATATGTT 
1'921 GTGTTTTACA GTATTATGTA GTCTGTTTTT TATGCAAAAT CTAATTTAAT ATATTGATAT 
1981 TTAT AT C ATT TTACGTTTCT CGTTCAGCTT TCTTGTACAA AGTGGTGATC ACTAGTCGGC 
2 041 GGCCGCTCTA GAGGATC CAA GCTTACGTAC GCGTGCATGC G AC G T CAT AG CTCTTCTATA 
2101- l GTGTCACCTA AATT C AATT C ACTGGCCGTC GTTTTACAAC GTCGTGACTG GGAAAACCCT 
2161 GGCGTTACCC AACTTAATCG CCTTGCAGCA CATCCCCCTT TCGCCAGCTG GCGTAATAGC 
2221 GAAGAGG C C C GCACCGATCG CCCTTCCCAA CAGTTGCGCA GCCTGAATGG CGAATGGACG 
22 81 CGCCCTGTAG CGGCGCATTA AGCGCGGCGG GTGTGGTGGT TACGCGCAGC GTGACCGCTA 
2 341 CACTTGCCAG CGCCCTAGCG CCCGCTCCTT TCGCTTTCTT CCCTTCCTTT CTCGCCACGT 
24 01 TCGCCGGCTT TCCCCGTCAA GCTCTAAATC GGGGGCTCCC TTTAGGGTTC CGATTTAGTG 

24 61 CTTTACGGCA CCTCGACCCC AAAAAACTTG ATTAGGGTGA TGGTTCACGT AG TG GG C CAT 
2 521 CGCCCTGATA GACGGTTTTT CGCCCTTTGA CGTTGGAGTC CACGTTCTTT AATAGT G G AC 

25 81 TCTTGTTCCA AAC T G G AAC A ACACTCAACC CTATCTCGGT CTATTCTTTT G ATTTATAAG - 
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2 641 GGATTTTGCC GATTTCGGCC TATTGGTTAA AAAATGAGCT GATTTAACAA AAATTTAACG 
27 01 C G AATTTTAA CAAAATATTA ACGTTTACAA TTTCAGGTGG CACTTTTCGG GGAAATGTGC 
2 761 GCGGAACCCC TATTTGTTTA TTTTTCTAAA T AC ATT C AAA TATGTATCCG CTCATGAGAC 
2821 AATAACCCTG AT AAATG CTT CAATAATATT GAAAAAGGAA GAGTATGAGT ATTCAACATT 
2881 TCCGTGTCGC CCTTATTCCC TTTTTTGCGG CATTTTGCCT TCCTGTTTTT GCTCACCCAG 

2 941 AAACGCTGGT GAAAGTAAAA GATG CTGAAG ATCAGTTGGG TGCACGAGTG GGTTACATCG 

3 001 AACTGGATCT CAACAGCGGT AAGATCCTTG AGAGTTTTCG CCCCGAAGAA CGTTTTCCAA 
3 061 TGATGAG C AC TTTTAAAGTT CTGCTATGTG GCGCGGTATT ATCCCGTATT GACGCCGGGC 
3121 AAGAGCAACT CGGTCGCCGC ATACACTATT CTCAGAATGA CTTGGTTGAG TACTCACCAG 
3181 TCACAGAAAA GCATCTTACG GATGG CATGA CAGTAAGAGA ATTATGCAGT GCTGCCATAA 
3 241 C C ATGAGTG A TAACACTGCG GCCAACTTAC TTCTGACAAC GATCGGAGGA CCGAAGGAGC 
33 01 TAACCGCTTT TTTGCACAAC ATGGGGGATC ATGTAACTCG CCTTGATCGT TGGGAACCGG 

33 61 AGCTGAATGA AGCCATACCA AACGACGAGC GTG AC AC C AC GATGCCTGTA GCAATGGCAA 
3 421 CAACGTTGCG CAAACTATTA ACTGG CGAAC TACTTACTCT AGCTTCCCGG CAACAATTAA 

34 81 TAGACTGGAT GGAGGCGGAT AAAGTTGCAG GACCACTTCT GCGCTCGGCC CTTCCGGCTG 
3 541 GCTGGTTTAT TG C T GAT AAA TCTGGAGCCG GTGAGCGTGG GTCTCGCGGT ATCATTGCAG 
3 6 01 CACTGGGGCC AGATGGTAAG CCCTCCCGTA TCGTAGTTAT CTACACGACG GGGAGTCAGG 
3 661 CAACTATGGA TGAACGAAAT AGACAGATCG CTGAGATAGG TGCCTCACTG ATTAAGCATT 
3 721 GGTAACTGTC AG AC C AAGTT TACTCATATA TACTTTAGAT TGATTTAAAA CTTCATTTTT 
3 781 AATTTAAAAG GATCTAGGTG AAGATCCTTT TTGATAATCT C ATG AC C AAA ATCCCTTAAC 
3 841 GTGAGTTTTC GTTCCACTGA GCGTCAGACC CCGTAGAAAA GATCAAAGGA TCTTCTTGAG 
3 901 ATC CTTTTTT TCTGCGCGTA ATCTGCTGCT TG C AAAC AAA AAAACCACCG CTACCAGCGG 

3 961 TGGTTTGTTT GCCGGATCAA GAG CT AC C AA CTCTTTTTCC GAAGGTAACT GGCTTCAGCA 

4 021 GAG C G C AG AT ACCAAATACT GTCCTTCTAG TGTAGCCGTA GTTAGGCCAC CACTTCAAGA 
4 081 ACTCTGTAGC ACCGCCTACA TACCTCGCTC TGCTAATCCT GTTACCAGTG GCTGCTGCCA 
4141 GTGGCGATAA GTCGTGTCTT ACCGGGTTGG ACTCAAGACG ATAGTTAC C G GATAAGGCGC 
42 01 AGCGGTCGGG CTGAACGGGG GGTTCGTGCA CACAGCCCAG CTTGGAGCGA ACGACCTACA 
4261 CCGAACTGAG AT AC C T AC AG CGTGAGCATT GAGAAAGCGC CACGCTTCCC GAAGGGAGAA 
4 321 AGGCGGACAG GTATCCGGTA AGCGGCAGGG TCGGAACAGG AGAGCGCACG AGGGAG CTTC 
4381 CAGGGGGAAA CGCCTGGTAT CTTTATAGT C CTGTCGGGTT TCGCCACCTC TGACTTGAGC 
4441 GTCGATTTTT GTGATGCTCG TCAGGGGGGC GGAGCCTATG GAAAAACGCC AGCAACGCGG 

45 01 CCTTTTTACG GTTCCTGGCC TTTTGCTGGC CTTTTGCTCA CATGTTCTTT CCTGCGTTAT 
4 561 CCCCTGATTC TGTGGATAAC CGTATTACCG CCTTTGAGTG AG CTGATAC C GCTCGCCGCA 
4 621 GCCGAACGAC CGAGCGCAGC GAGTCAGTGA GCGAGGAAGC GGAAGAGCGC CCAATACGCA 

46 81 AACCGCCTCT CCCCGCGCGT TGGCCGATTC ATTAATGCAG AGCTTGCAAT TCGCGCGCGA 
4 741 AGGCGAAGCG GCATTTACGT TGACACCATC GAATGGCGCA AAACCTTTCG CGGTATGGCA 
4 8 01 TGATAGCGCC CGGAAGAGAG TCAATTCAGG GTGGTGAATG TGAAACCAGT AACGTTATAC 
48 61 GATGTCGCAG AGTATGCCGG TGTCTCTTAT C AG AC C GTTT CCCGCGTGGT GAACCAGGCC 
4 921 AG C C ACGTTT CTGCGAAAAC GCGGGAAAAA GTGGAAGCGG CGATGGCGGA GCTGAATTAC 

4 981 ATTCCCAACC ■ GCGTGGCACA AC AACTGG CG GGCAAACAGT CGTTGCTGAT TGGCGTTGCC 

5 041 ACCTCCAGTC TGGCCCTGCA CGCGCCGTCG CAAATTGTCG C G G C G ATT AA ATCTCGCGCC 
5101 GATCAACTGG GTGCCAGCGT GGTGGTGTCG ATGGTAGAAC GAAGCGGCGT CGAAGCCTGT 
5161 AAAGCGGCGG TGCACAATCT TCTCGCGCAA CGGGTCAGTG GGCTGATCAT TAACTATCCG 
5221 CTGGATGACC AGGATGCCAT TGCTGTGGAA GCTGCCTGCA CTAATGTTCC GGCGTTATTT 
52 81 CTTGATGTCT CTGACCAGAC ACCCATCAAC AGTATTATTT TCTCC CATGA AGACGGTACG 
5341 CGACTGGGCG TGGAGCATCT GGTCGCATTG GGTCACCAGC AAATCGCGCT GTTAGCGGGC 
54 01 C C ATTAAGTT CTGTCTCGGC GCGTCTGCGT CTGGCTGGCT GGCATAAATA TCTCACTCGC 
54 61 AAT C AAATT C AGCCGATAGC GGAACGGGAA GGCGACTGGA GTGCCATGTC CGGTTTTCAA 
5 521> C AAAC C ATG C AAATG C TGAA TG AGG G CAT C GTTCCCACTG CGATGCTGGT TGCCAACGAT 
5 581 CAGATGGCGC TGGGCGCAAT GCGCGCCATT ACCGAGTCCG GGCTGCGCGT TGGTGCGGAT 
5 641 ATCTCGGTAG TGGGATACGA CGATAC CGAA GACAG CTCAT GTTATATCCC GCCGTCAACC 
57 01 ACCATCAAAC AGGATTTTCG CCTGCTGGGG C AAAC C AG CG TGGACCGCTT GCTGCAACTC 

57 61 TCTCAGGGCC AGGCGGTGAA GGGCAATCAG CTGTTGCCCG TCTCACTGGT GAAAAGAAAA 
5821 ACCACCCTGG CGCCCAATAC GCAAACCGCC TCTCCCCGCG CGTTGGCCGA TTCATTAATG 

58 81 CAGCTGGCAC GACAGGTTTC CCGACTGGAA AGCGGGCAGT GAGCGCAACG CAATTAATGT 
5 94 1 GAGTTAGCTC ACTCATT 



WO 00/52027 /_ / PCT/US00/05432 

)0 



Figure %Ar r ^>T( p pSPORT 

(opposite strand) 



1 taa/cgc cag ggt ttt ccc agt cac gac gtt gta aaa cga egg cca gt*g aat 
att gcg gtc cca aaa ggg tea gtg ctg caa cat ttt get gee ggt cac tta 



52 



tga 
act 



tt tag gtg aca eta ta 
[taa ate cac tgt gat at 



aag age tat gac gtc qca tgj b a ba c 
ttc teg ata ctg cag cjgt a eg tgc g 




103 



V 3 V ^ a ^1 

tafeLjact tg£_a^ctj[c__tag agcUgccgc cga l (ctagtg ate (aca Aflt t 
att cgajacc tag^gag atc|tcg ccg^eg get gat cjac tag [tgt tea aac 




atg 




™?H£H£ C i g S^-ttc egg acc gotj^z tgc_jJgg cgt acc age ttt [ccc" 
age ag^jg £c clT^ gec tgg ^TtTOjacf^S gca tgg tig aaa g|| 

~T1 <JUA 



2041 tat agt gag "teg tat te 
ata tea etc age ata a t 



age ttg gcg taa tea tgg tea tag ctg ttt cct 
teg aac cgc att agt acc agt at e gac aaa gga. 



2092 £S lit S £ ^ «c !^ cgfrww.oc 

get egg cct teg 



cac act ~ X - L - V - acc cca cac aac ata 

cacact_tta aca ata ggc gag tgc t.a_y t fltff ggg gg 



7 r 



2143 



ata 




r ~ 5<5 


aag 


tgt aaa 


tat 


ttc 


aca ttt 



gee tgg ggt gee taa tga gtg* age taa etc aca tta att 
egg acc cca caa att Act <-*^ i-™ *t-t- - - ^ ^ 



WO 00/52027 



o 



PCT/US00/05432 



Figure 2l& 



(cont'd) 



Xmnl 
ScaL 



Bsal / 

Ahdl 



AtwNI 



NgoMIV 
Nael* 
Dralll • 




attR1 



5957 bps 



dQOQ 



BspLLM 1 



PflMl 



3000 





i - 



UBbel 
T?Ehel 

: Nari 
Esp3l 
i EcoRV 
iApa[ 

Bsp120I 
BstEI! 



Aatll 

Sphl 

Mlul 

Sp!l 

SnaBf 

HrndllJ 

BamHI 

Xbal 

Notf 

Xmall! 

Spel 



Accl 
Sail 
Aval 
Smaf 
Xmal 
EcoR[ 
BspEl 
Rsrll 
Agel 
Asp718t 
Kpnl 
BspMI 
Sse8387l 
tPstl 



WO 00/52027 



PCT/US00/05432 



pDEST6 5957 bp 



Location (Base Nos 



266 . 
516 . 
1295 
1517 
1863 
2203 
4403 
5392 



142 
1175 
. 1379 
. 1822 
. 1987 
.3369 
. 5260 
.5847 



Gene Encoded 

attRl 

CmR 

inactivated ccdA 

ccdB 

attR2 

lacl 

ampR 
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1 TAACGCCAGG GTTTTCCCAG TCACGACGTT GTAAAACGAC GGCCAGTGAA TTGAATTTAG 
61 GTGACACTAT AGAAGAGCTA TGACGTCGCA TGCACGCGTA CGTAAGCTTG GATCCTCTAG 
121 AGCGGCCGCC GACTAGTGAT CACAAGTTTG TACAAAAAAG CTGAACGAGA AACGTAAAAT 
181 GATATAAATA TCAATATATT AAATTAGATT TTGCATAAAA AACAGACTAC ATAATACTGT 
241 AAAACACAAC ATATC CAGTC ACTATGGCGG CCGCTAAGTT GGCAGCATCA CCCGACGCAC 
3 01 TTTGCGCCGA AT AAAT AC C T GTGACGGAAG ATCACTTCGC AGAATAAATA AATC CTGGTG 
361 TCCCTGTTGA TACCGGGAAG CCCTGGGCCA ACTTTTGGCG AAAATGAGAC GTTGATCGGC 
421 ACGTAAGAGG TTCCAACTTT CACCATAATG AAATAAGATC ACTACCGGGC GTATTTTTTG 
481 AGTT AT C GAG ATTTTCAGGA GCTAAGGAAG CTAAAATGGA GAAAAAAATC ACTGGATATA 
541 CCACCGTTGA TAT AT C C C AA TGGCATCGTA AAGAACATTT TGAGGCATTT CAGTCAGTTG 
601 CTCAATGTAC CTATAACCAG ACCGTTCAGC TGGATATTAC GGCCTTTTTA AAGACCGTAA 
661 AGAAAAATAA G C AC AAGTTT TATCCGGCCT TTATT C AC AT TCTTGCCCGC CTGATGAATG 
721 CTCATCCGGA ATTCCGTATG GCAATGAAAG ACGGTGAGCT GGTGATATGG GATAGTGTTC 
781 ACCCTTGTTA CACCGTTTTC C ATG AG C AAA CTGAAACGTT TTCATCGCTC TGGAGTGAAT 
841 ACCACGACGA TTTCCGGCAG TTTCTACACA TATATTCGCA AGATGTGGCG TGTTACGGTG 
9 01 AAAACCTGGC CTATTTCCCT AAAGGGTTTA TTGAGAATAT GTTTTTCGTC TCAGCCAATC 
961 CCTGGGTGAG TTTCACCAGT TTTGATTTAA ACGTGGCCAA TATGGACAAC TTCTTCGCCC 
1021 CCGTTTTCAC CATGGGCAAA TATTATACGC AAGGCGACAA GGTGCTGATG CCGCTGGCGA 
1081 TTCAGGTTCA TCATGCCGTC TGTGATGGCT TCCATGTCGG CAGAATGCTT AATGAATTAC 
1141 AACAGTACTG CGATGAGTGG CAGGGCGGGG CGTAAACGCG TGGATCCGGC TTACTAAAAG 
12 01 C C AG ATAAC A GTATGCGTAT TTGCGCGCTG ATTTTTGCGG TATAAGAATA TATACTGATA 

12 61 TGTATACCCG AAGTATGTCA AAAAGAG GTG TGCTATGAAG CAGCGTATTA CAGTGACAGT 
1321 TGACAGCGAC AG C T AT C AG T TGCTCAAGGC ATATATGATG TCAATATCTC CGGTCTGGTA 

13 81 AG C AC AAC C A TGCAGAATGA AGCCCGTCGT CTGCGTGCCG AACGCTGGAA AGCGGAAAAT 
1441 CAGGAAGGGA TGGCTGAGGT CGCCCGGTTT ATTGAAATGA ACGGCTCTTT TGCTGACGAG 

15 01 AACAGGGACT G GTG AAATGC AGTTTAAGGT TTAC AC CTAT AAAAGAGAGA GCCGTTATCG 
1561 TCTGTTTGTG G ATG TAC AG A GTGATATTAT TGACACGCCC GGGCGACGGA TGGTGATCCC 
1621 CCTGGCCAGT GCACGTCTGC TGTCAGATAA AGTCTCCCGT GAACTTTACC CGGTGGTGCA 

16 81 TATCGGGGAT GAAAGCTGGC GCATGATGAC CACCGATATG GCCAGTGTGC CGGTCTCCGT 
1741 TATCGGGGAA GAAGTGGCTG AT CT C AG C C A CCGCGAAAAT GACATCAAAA ACGCCATTAA 
18 01 CCTGATGTTC TGGGGAATAT AAATGTCAGG CTCCCTTATA CACAGCCAGT CTGCAGGTCG 
18 61 AC C ATAGTG A CTGGATATGT TGTGTTTTAC AGTATTATGT AGTCTGTTTT TTATG C AAAA 
1921 TCTAATTTAA TATATTGATA TTT ATAT CAT TTTACGTTTC TCGTTCAGCT TTCTTGTACA 
1'981 AAGTGGTGAT CGTCGACCCG GGAATTCCGG ACCGGTACCT GCAGGCGTAC CAGCTTTCCC 
2 041 T ATAGTG AG T C G T ATT AG AG CTTGGCGTAA TCATGGTCAT AGCTGTTTCC T GTG TG AAAT 
2101 TGTTATCCGC TCACAATTCC ACACAACATA CGAGCCGGAA GCATAAAGTG TAAAGCCTGG 
2 1 6 1 GGTGCCTAAT GAGTGAGCTA ACTCACATTA ATTGCGTTGC GCTCACTGCC CGCTTTCCAG 
2221 TCGGGAAACC TGTCGTGCCA GCTGCATTAA TGAATCGGCC AACGCGCGGG GAGAGGCGGT 
2281 TTGCGTATTG GGCGCCAGGG TGGTTTTTCT TTTCACCAGT GAGACGGGCA ACAGCTGATT 
2341 GCCCTTCACC GCCTGGCCCT GAGAGAGTTG CAGCAAGCGG TCCACGCTGG TTTGCCCCAG 
24 01 CAGGCGAAAA T C CTGTTTG A TGGTGGTTGA CGGCGGGATA TAAC ATG AG C TGTCTTCGGT 
2461 ATCGTCGTAT CCCACTACCG AG AT AT C C G C ACCAACGCGC AGCCCGGACT CGGTAATGGC 
2521 GCGCATTGCG CCCAGCGCCA TCTGATCGTT G G C AAC C AG C ATCGCAGTGG GAACGATGCC 
2581 CTCATTCAGC ATTTGCATGG TTTGTTGAAA ACCGGACATG GCACTCCAGT CGCCTTCCCG 
2 641 TTCCGCTATC GGCTGAATTT G ATT G C GAG T GAGATATTTA TGCCAGCCAG CCAGACGCAG - 
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2 701 ACGCGCCGAG ACAGAACTTA ATGGGCCCGC TAACAGCGCG ATTTGCTGGT GACCCAATGC 
2 761 GACCAGATGC TCCACGCCCA GTCG CGTACC GTCTTCATGG GAGAAAATAA TACTGTTGAT 
2 821 GGGTGTCTGG T C AG AG AC AT CAAGAAATAA CGCCGGAACA TTAGTGCAGG CAGCTTCCAC 
2 881 AGCAATGGCA TCCTGGTCAT C C AG C G GAT A GTTAATG AT C AGCCCACTGA CCCGTTGCGC 

2 941 GAGAAGATTG TGCACCGCCG CTTTACAGGC TTCGACGCCG CTTCGTTCTA CCATCGACAC 

3 001 CACCACGCTG GCACCCAGTT GATCGGCGCG AG ATTTAAT C GCCGCGACAA TTTGCGACGG 
3 061 CGCGTGCAGG GCCAGACTGG AGGTGGCAAC GCCAATCAGC AACGACTGTT TGCCCGCCAG 
3121 TTGTTGTGCC ACGCGGTTGG GAATGTAATT CAGCTCCGCC ATCGCCGCTT CCACTTTTTC 
3181 CCGCGTTTTC GCAGAAACGT GGCTGGCCTG GTTCACCACG CGGGAAACGG TCTGATAAGA 
3241 G AC AC C G G C A TACTCTGCGA CATCGTATAA CGTTACTGGT TTCACATTCA CCACCCTGAA 
3 3 01 TTGACTCTCT TCCGGGCGCT AT C ATG C CAT ACCGCGAAAG GTTTTGCGCC ATTCGATGGT 
3 361 GTCAACGTAA ATGCCGCTTC GCCTTCGCGC GCGAATTGCA AGCTCTGCAT TAATGAATCG 
3421 GCCAACGCGC GGGGAGAGGC GGTTTGCGTA TTGGGCGCTC TTCCGCTTCC TCGCTCACTG 
34 81 ACTCGCTGCG CTCGGTCGTT CGGCTGCGGC GAGCGGTATC AGCTCACTCA AAGGCGGTAA 
3 541 TACGGTTATC CACAGAATCA GGGGATAACG CAGGAAAGAA C ATGTG AG C A AAAGGCCAGC 
3 601 AAAAGG C C AG GAACCGTAAA AAGGCCGCGT TGCTGGCGTT TTT C CAT AG G CTCCGCCCCC 
3 661 CTGACGAGCA TCACAAAAAT CGACGCTCAA GTCAGAGGTG GCGAAACCCG ACAGGACTAT 
3721 AAAG AT AC C A GGCGTTTCCC CCTGGAAGCT CCCTCGTGCG CTCTCCTGTT CCGACCCTGC 
3 781 CGCTTACCGG ATACCTGTCC GCCTTTCTCC CTTCGGGAAG CGTGGCGCTT TCTCAATGCT 
3 841 CACGCTGTAG GTATCTCAGT TCGGTGTAGG TCGTTCGCTC CAAGCTGGGC TGTGTGCACG 
3 901 AACCCCCCGT TCAGCCCGAC CGCTGCGCCT TATCCGGTAA CTATCGTCTT GAGTCCAACC 

3 961 CGGTAAGACA CGACTTATCG CCACTGGCAG CAGCCACTGG TAACAGGATT AGCAGAGCGA 

4 021 GGTATGTAGG CGGTGCTACA GAGTTCTTGA AGTGGTGGCC TAACTACGGC TACACTAGAA 
4 081 G G AC AGTATT TGGTATCTGC GCTCTGCTGA AG C C AGTTAC CTTCGGAAAA AGAGTTGGTA 
4141 GCTCTTGATC CGGCAAACAA ACCACCGCTG GTAGCGGTGG TTTTTTTGTT TGCAAGCAGC 
42 01 AGATTACGCG CAGAAAAAAA GGATCTCAAG AAGATCCTTT GATCTTTTCT ACGGGGTCTG 
4 261 ACGCTCAGTG GAACGAAAAC TCACGTTAAG GGATTTTGGT CATGAGATTA T C AAAAAGG A 
4 321 TCTTCACCTA GATCCTTTTA AATTAAAAAT GAAGTTTTAA ATCAATCTAA AG TAT AT ATG 
4 381 AGTAAACTTG GTCTGACAGT TACCAATGCT TAATCAGTGA GGCACCTATC TCAGCGATCT 
4 441 GTCTATTTCG TT CAT C C ATA GTTGCCTGAC TCCCCGTCGT GTAGATAACT ACGATACGGG 
4 501 AGGGCTTACC ATCTGGCCCC AGTGCTGCAA TGATACCGCG AGACCCACGC TCACCGGCTC 
4 561 C AG ATTTAT C AG C AATAAAC CAGCCAGCCG GAAGGGCCGA GCGCAGAAGT GGTCCTGCAA 
4 621 CTTTATCCGC CTCCATCCAG TCTATTAATT GTTGCCGGGA AGCTAGAGTA AGTAGTTCGC 
4 681 CAGTTAATAG TTTGCGCAAC GTTGTTGCCA TTGCTACAGG CATCGTGGTG TCACGCTCGT 
4 741 CGTTTGGTAT GGCTTCATTC AGCTCCGGTT CCCAACGATC AAGGCGAGTT ACATGATCCC 
4 801 CCATGTTGTG CAAAAAAGCG GTTAGCTCCT TCGGTCCTCC GATCGTTGTC AGAAGTAAGT 
4 861 TGGCCGCAGT GTTATCACTC ATGGTTATGG CAGCACTGCA TAATTCTCTT ACTGTCATGC 
4 921 CATC C GTAAG ATGCTTTTCT GTGACTGGTG AGTACTCAAC CAAGTCATTC TGAGAATAGT 

4 981 GTATGCGGCG AC CGAGTTGC TCTTGCCCGG CGTCAATACG GGATAATACC GCGCCACATA 

5 041 GCAGAACTTT " AAAAGTGCTC ATCATTGGAA AACGTTCTTC GGGGCGAAAA CTCTCAAGGA 
5101 TCTTACCGCT GTTGAGATCC AGTTCGATGT AACCCACTCG TGCACCCAAC TGATCTTCAG 
5161 CATCTTTTAC TTTCACCAGC GTTTCTGGGT GAGCAAAAAC AGGAAGGCAA AATGCCGCAA 
5221 AAAAGG GAAT AAGGG CGAC A CGGAAATGTT G AATACT CAT ACTCTTCCTT TTTCAATATT 
52 81 ATTGAAGCAT TTATCAGGGT TATTGT CTCA T GAG C GG ATA CATATTTGAA TGTATTTAGA 
5341 AAAATAAACA AATAGGGGTT CCGCGCACAT TTCCCCGAAA AGTGCCACCT GAAATTGTAA 
54 01 AC GTTAATAT TTTGTTAAAA TTCGCGTTAA ATTTTTGTTA AATCAGCTCA TTTTTTAACC 
54 61 AATAGGC CGA AATCGGCAAA ATC CCTTATA AATCAAAAGA ATAGACCGAG ATAGGGTTGA 
5521 GTGTTGTTCC AGTTTGGAAC AAGAGTCCAC TATTAAAGAA CGTGGACTCC AACGTCAAAG 
5581^ GGCGAAAAAC CGTCTATCAG GGCGATGGCC CACTACGTGA ACCATCACCC TAATCAAGTT 
5641 TTTTGGGGTC GAGGTGCCGT AAAGCACTAA ATCGGAACCC TAAAGGGAGC CCCCGATTTA 
5701 GAGCTTGACG GGGAAAGCCG GCGAACGTGG C G AG AAAGG A AGGGAAGAAA GCGAAAGGAG 
5761 CGGGCGCTAG GGCGCTGGCA AGTGTAGCGG TCACGCTGCG CGTAACCACC ACACCCGCCG 
5821 C G CTTAATG C GCCGCTACAG GGCGCGTCCA TTCGCCATTC AGGCTGCGCA ACTGTTGGGA 
58 81 AGGGCGATCG GTGCGGGCCT CTTCGCTATT ACGCCAGCTG GCGAAAGGGG GATGTGCTGC 
5 941 AAGG CGATTA AGTTGGG 
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CMV promoter for eukaiy otic 
expression 



970 cca ttg acg 
ggt aac tgc 

1021 aga get cgt 
tct cga gca 
CMV 

1072 ttt gac etc 
aaa ctg gag 

1123 tag gec gcg 
ate egg cgc 

1174 ggc ttt tgc 
ccg aaa acg 

1225 tact egg tec 
atg gec agg 



caa atg ggc ggt 

gtt tac ccg cca 



tta gtg aac cgt 
aat cac ttg gca 

cat aga aga cac 
gta tct tct gtg 

gag egg ata aca 

etc gee tat tgt 

aaa aag eta ttt 
ttt ttc gat aaa 

gg f att ccc ate 
cct taa|ggg tag 



agg cgt 

tec gca 

cag ate 
gtc tag 
tcrfe^ « 
egg gac 
gec ctg 

att tea 
taa agt 

agg tga 
tec act 



gta egg tgg gag gtc 
cat gee acc etc cag 

gee tgg aga cgc cat 
egg acc tct gcg gta 



cga tec age etc 
get agg teg gag gc 

cac agg aaa cag 

gtg tec ttt gtc 



tat ata age 
ata tat teg 

cca cgc tgt 
ggt gcg aca 

act eta gec 
tga gat egg 



aca agt 
tgt tea 



cac tat aga agg 
gtg ata tct tec 

tt 



_aac atg tttTt 



eta tga cca eta 
gat act ggt gat 

tac gec tgc ^ g 
atg egg^acg tc|* 




Nsil 

.SnaBI 




! Muni 
Hpal 
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pDEST7 6025 bp (rotated to position 2800) 



Location (Base Nos . 
67 . . 589 
906 . . 782 
1015 . . 1674 
1794 . . 1878 
2016 . .2321 
2362 . .2486 
2671 . .3033 
3227 . .3502 
3962 . .4822 
5022 . . 5661 



Gene Encoded 
CMV promoter 
attRl 
CmR 

inactivated ccdA 
ccdB 
attR2 
small 
f 1 

ampR 
ori 



t & polyA 



1 ATTATCATGA CATTAACCTA TAAAAATAGG CGTAGTACGA GGCCCTTTCA CTCATTAGAT 
61 GCATGTCGTT ACATAACTTA CGGTAAATGG CCCGCCTGGC TGACCGCCCA ACGACCCCCG 
121 CCCATTGACG TCAATAATGA CGTATGTTCC CATAGTAACG CCAATAGGGA CTTTC CATTG 
181 ACGTCAATGG GTGGAGTATT TACGGTAAAC TGCCCACTTG GCAGTACATC AAGTGTATCA 
241 TATGCCAAGT ACGCCCCCTA TTGACGTCAA TGACGGTAAA TGGCCCGCCT GGCATTATGC 
3 01 CCAGTACATG ACCTTATGGG ACTTTCCTAC TTGGCAGTAC ATCTACGTAT TAGTCATCGC 
361 TATTAC C ATG GTGATGCGGT TTTGGCAGTA CATCAATGGG CGTGGATAG C GGTTTGACTC 
421 ACGGGGATTT CCAAGTCTCC ACCCCATTGA CGTCAATGGG AGTTTGTTTT GGCACCAAAA 
481 TCAACGGGAC TTTCCAAAAT GTCGTAACAA CTCCGCCCCA TTG AC G C AAA TGGGCGGTAG 
541 GCGTGTACGG TGGGAGGTCT ATATAAG C AG AGCTCGTTTA GTGAACCGTC AGATCGCCTG 
6 01 GAG AC G C CAT CCACGCTGTT TTGACCTCCA TAGAAGACAC CGGGACCGAT CCAGCCTCCG 
661 GACTCTAGCC TAGGCCGCGG AGCGGATAAC AATTTCACAC AGGAAACAGC TATGACCATT 
721 AGGCCTTTGC AAAAAG C TAT TTAGGTGACA CTATAGAAGG TACGCCTGCA GGTACCGGAT 
781 CACAAGTTTG TACAAAAAAG CTGAACGAGA AACGTAAAAT GATATAAATA TCAATATATT 
841 AAATTAGATT TTG C ATAAAA AACAGACTAC ATAATACTGT AAAACACAAC ATATCCAGTC 
901 ACTATGGCGG CCGCATTAGG CACCCCAGGC TTTACACTTT ATGCTTCCGG CTCGTATAAT 
961 GTGTGGATTT TGAGTTAGGA TCCGTCGAGA TTTTCAGGAG CTAAGGAAG C T AAAAT G GAG 
1021 AAAAAAATCA C TG GAT ATA C CACCGTTGAT ATATCCCAAT GGCATCGTAA AGAACATTTT 
1081 GAGGCATTTC AGTCAGTTGC TCAATGTACC TATAACCAGA CCGTTCAGCT GGATATTACG 
1141 GCCTTTTTAA AG AC C G T AAA GAAAAATAAG CACAAGTTTT ATCCGGCCTT TATTCACATT 
12 01 CTTGCCCGCC TGATGAATGC TCATCCGGAA TTCCGTATGG CAATGAAAGA CGGTGAGCTG 

12 61 GTGATATGGG ATAGTGTTCA CCCTTGTTAC ACCGTTTTCC ATGAGCAAAC TGAAACGTTT 
1321 TCATCGCTCT GGAGTGAATA C C AC G AC GAT TTCCGGCAGT TTCTACACAT AT ATT C G C AA 

13 81 GATGTGGCGT GTTACGGTGA AAACCTGGCC TATTTCCCTA AAGGGTTTAT TGAGAATATG 

14 41 TTTTTCGTCT CAGCCAATCC CTGGGTGAGT TTCACCAGTT TTGATTTAAA CGTGGCCAAT 
1501 ATGGACAACT ' TCTTCGCCCC CGTTTTCACC ATGGGCAAAT ATTATACGCA AGGCGACAAG 
1561 GTGCTGATGC CGCTGGCGAT TCAGGTTCAT CATGCCGTCT GTGATGGCTT CCATGTCGGC 
1621 AGAATGCTTA ATGAATTACA AC AGTACTG C GATGAGTGGC AGGGCGGGGC GTAAACGCGT 
16 81 GGATCCGGCT TACTAAAAGC CAGATAACAG TATGCGTATT TGCGCGCTGA TTTTTGCGGT 
1741 ATAAGAATAT ATACTGATAT GTATACCCGA AGTATGTCAA AAAGAGGTGT GCTATGAAGC 
18 01 AG CGTATTAC AGTGACAGTT G AC AG C G AC A GCTATCAGTT GCTCAAGGCA TATATGATGT 
18 61 CAATATCTCC GGTCTGGTAA G C AC AAC CAT GCAGAATGAA GCCCGTCGTC TGCGTGCCGA 
1921 ACGCTGGAAA GCGGAAAATC AGGAAGGGAT GGCTGAGGTC GCCCGGTTTA TTGAAATGAA 
1981 CGGCTCTTTT GCTGACGAGA ACAGGGACTG GTGAAATGCA GTTTAAGGTT TACACCTATA 
2 041 AAAGAGAGAG CCGTTATCGT CTGTTTGTGG ATGTACAGAG TGATATTATT GACACGCCCG 
2101 GGCGACGGAT GGTGATCCCC CTGGCCAGTG CACGTCTGCT GTCAGATAAA GTCTCCCGTG 
2161 AACTTTACCC GGTGGTGCAT ATCGGGGATG AAAGCTGGCG C ATG ATG AC C AC CG ATATGG 
2221 CCAGTGTGCC GGTCTCCGTT ATCGGGGAAG AAGTGGCTGA TCTCAGCCAC CGCGAAAATG 
2281 ACATCAAAAA CG C CATTAAC CTGATGTTCT GGGGAATATA AATGTCAGGC TCCCTTATAC 
2 341 ACAGCCAGTC TGCAGGTCGA C CAT AG T G AC T G GAT ATG TT GTGTTTTACA GTATTATGTA 
24 01 GTCTGTTTTT TATG C AAAAT CTAATTTAAT ATATTGATAT T TAT AT CAT T TTACGTTTCT 
24 61 CGTTCAGCTT TCTTGTACAA AGTGGTGATC GCGTGCATGC GACGTCATAG CTCTCTCCCT 
2 521 ATAGTGAGTC GTATTATAAG CTAGGCACTG GCCGTCGTTT TACAACGTCG TGACTGGGAA- 
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2 581 AACTGCTAGC TTGGGATCTT TGTGAAGGAA CCTTACTTCT GTGGTGTGAC ATAATTGGAC 
2 641 AAACTACCTA CAGAGATTTA AAGCTCTAAG GTAAATATAA AATTTTTAAG TGTATAATGT 
2 701 GTTAAACTAG CTGCATATGC TTGCTGCTTG AG AG TTTTG C TTACTGAGTA TGATTTATGA 
2 761 AAATATTATA CACAGGAGCT AGTGATTCTA ATTG TTTGTG TATTTTAGAT TCACAGTCCC 
2 821 AAGGCTCATT TCAGGCCCCT CAGTCCTCAC AGTCTGTTCA TGATCATAAT CAGCCATACC 
2 881 ACATTTGTAG AGGTTTTACT TGCTTTAAAA AACCTCCCAC ACCTCCCCCT GAAC CTGAAA 

2 941 CATAAAATGA ATGCAATTGT TGTTGTTAAC TTGTTTATTG C AG CTTATAA TGGTTACAAA 

3 001 TAAAGCAATA GCATCACAAA TTTCACAAAT AAAGCATTTT TTTCACTGCA TTCTAGTTGT 
3 061 GGTTTGTCCA AACTCATCAA TGTATCTTAT CATGTCTGGA TCGATCCTGC ATTAATGAAT 
3121 CGGCCAACGC GCGGGGAGAG GCGGTTTGCG TATTGGCTGG CGTAATAGCG AAGAGGCCCG 
3181 CACCGATCGC CCTTCCCAAC AGTTGCGCAG CCTGAATGGC GAATGGGACG CGCCCTGTAG 
3 241 CGGCGCATTA AGCGCGGCGG GTGTGGTGGT TACGCGCAGC GTGACCGCTA CACTTGCCAG 
3 3 01 CGCCCTAGCG CCCGCTCCTT TCGCTTTCTT CCCTTCCTTT CTCGCCACGT TCGCCGGCTT 
3 361 TCCCCGTCAA GCTCTAAATC GGGGGCTCCC TTTAGGGTTC CGATTTAGTG CTTTACGGCA 
3 421 CCTCGACCCC AAAAAACTTG ATTAGGGTGA TGGTTCACGT AGTGGGCCAT CGCCCTGATA 
3 481 GACGGTTTTT CGCCCTTTGA CGTTGGAGTC CACGTTCTTT AATAGTGGAC TCTTGTTCCA 
3 541 AACTGGAACA ACACTCAACC CTATCTCGGT CTATTCTTTT GATTTATAAG GGATTTTGCC 
3 6 01 GATTTCGGCC TATTG GTTAA AAAATG AG C T GATTTAACAA AAATTTAACG CGAATTTTAA 
3 661 CAAAATATTA ACGTTTACAA TTTCAGGTGG CACTTTTCGG GGAAATGTGC GCGGAACCCC 
3 721 TATTTGTTTA TTTTTCTAAA TAC ATT C AAA TATGTATCCG CTCATGCCAG GTCTTGGACT 
3 781 GGTGAGAACG GCTTGCTCGG CAGCTTCGAT GTGTGCTGGA GGGAGAATAA AGGTCTAAGA 
3 841 TGTGCGATAG AGGGAAGTCG CATTGAATTA TGTG CTGTGT AGGGATCGCT GGTATCAAAT 
3 901 ATGTGTGCCC ACCCCTGGCA TGAGACAATA ACCCTGATAA ATGCTTCAAT AATATTGAAA 

3 961 AAGGAAGAGT ATGAGTATTC AACATTTCCG TGTCGCCCTT ATTCCCTTTT TTGCGGCATT 

4 021 TTGCCTTCCT GTTTTTGCTC ACCCAGAAAC GCTGGTGAAA GTAAAAGATG CTGAAGATCA 
4 081 GTTGGGTGCA CGAGTGGGTT ACATCGAACT GGAT CTCAAC AGCGGTAAGA TCCTTGAGAG 
4141 TTTTCGCCCC GAAGAACGTT TTC CAATGAT GAG C ACTTTT AAAGTTCTGC TATGTGGCGC 
42 01 GGTATTATCC CGTATTGACG CCGGGCAAGA GCAACTCGGT CGCCGCATAC ACTATTCTCA 
42 61 GAATGACTTG GTTGAGTACT CACCAGTCAC AGAAAAGCAT CTTACGGATG GCATGACAGT 
4 321 AAGAGAATTA TGCAGTGCTG C C ATAAC CAT GAGTGATAAC ACTGCGGCCA ACTTACTTCT 
4 381 GACAACGATC GGAGGACCGA AGGAGCTAAC CGCTTTTTTG CACAACATGG GGGATCATGT 
4 441 AACTCGCCTT GATCGTTGGG AACCGGAGCT GAATGAAGCC ATAC CAAACG ACGAGCGTGA 
4 501 CACCACGATG CCTGTAGCAA TGGCAACAAC GTTGCGCAAA CTATTAACTG GCGAACTACT 
4 561 TACTCTAGCT TCCCGGCAAC AATTAATAGA CTGGATGGAG GCGGATAAAG TTGCAGGACC 
4 621 ACTTCTGCGC TCGGCCCTTC CGGCTGGCTG GTTTATTGCT GATAAATCTG GAGCCGGTGA 
4 681 GCGTGGGTCT CGCGGTATCA TTGCAGCACT GGGGCCAGAT GGTAAGCCCT CCCGTATCGT 
4 741 AGTTATCTAC ACGACGGGGA GTCAGGCAAC TATGGATGAA CGAAATAGAC AGATCGCTGA 
48 01 GATAGGTGCC TCACTGATTA AG C ATTGGTA ACTGTCAGAC CAAGTTTACT CATATATACT 
4 861 TTAGATTGAT TTAAAACTTC ATTTTTAATT TAAAAGG AT C TAGGTGAAGA TCCTTTTTGA 
4 921 TAATCTCATG CCATAACTTC GTATAATGTA TGCTATACGA AGTTATGGCA TGACCAAAAT 

4 981 CCCTTAACGT GAGTTTTCGT TCCACTGAGC GTCAGACCCC GTAGAAAAGA TCAAAGGATC 

5 041 TTCTTGAGAT CCTTTTTTTC TGCGCGTAAT CTGCTGCTTG CAAACAAAAA AACCACCGCT 
5101 ACCAGCGGTG GTTTGTTTGC CGGATCAAGA GCTACCAACT CTTTTTCCGA AGGTAACTGG 
5161 CTTCAGCAGA G C G C AG ATAC CAAATACTGT CCTTCTAGTG TAG C C G TAGT TAGGCCACCA 
5221 CTTCAAGAAC TCTGTAGCAC CGCCTACATA CCTCGCTCTG CTAATCCTGT TACCAGTGGC 
'5-281 TGCTGCCAGT GGCGATAAGT CGTGTCTTAC CGGGTTGGAC TCAAGACGAT AGTTACCGGA 
5 34 1 TAAGGCGCAG CGGTCGGGCT GAACGGGGGG TTCGTGCACA CAGCCCAGCT TGGAGCGAAC 

54 01 GACCTACACC GAACTGAGAT ACCTACAGCG TG AG C ATTG A GAAAGCGCCA CGCTTCCCGA 
5 4 6 1 > AGGGAGAAAG GCGGACAGGT ATC CGGTAAG CGGCAGGGTC GGAACAGGAG AGCGCACGAG 
5 521 GGAGCTTCCA GGGGGAAACG CCTGGTATCT TTATAGTCCT GTCGGGTTTC GCCACCTCTG 

55 81 ACTTGAG CGT CGATTTTTGT GATGCTCGTC AGGGGGGCGG AGCCTATGGA AAAAC G C C AG 

56 41 CAACGCGGCC TTTTTACGGT TCCTGGCCTT TTGCTGGCCT TTTGCTCACA TGTTCTTTCC 

57 01 TGCGTTATCC CCTGATTCTG TGGATAACCG TATTACCGCC TTTGAGTGAG CTGATACCGC 
5761 TCGCCGCAGC CGAACGACCG AGCGCAGCGA GTCAGTGAGC GAGGAAGCGG AAGAGCGCCC 
5 821 AATACG C AAA CCGCCTCTCC CCGCGtGTTG GCCGATTCAT TAATG C AG AG CTTG CAATTC 
5 881 GCGCGTTTTT CAATATTATT GAAGCATTTA TCAGGGTTAT TGTCTCATGA G CG G ATAC AT 

5 94 1 ATTTGAATGT ATTTAGAAAA ATAAACAAAT AGGGGTTCCG CGCACATTTC CCCGAAAAGT 

6 001 GCCACCTGAC GTCTAAGAAA CCATT 
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Figure pDEST8 Polyhedron Promoter, Baculo virus 

Transfer Plasmid 



1 ^£^5* CtC ° gg a&t att aat aga tca a * a taa tta &aa tga taa cca 

gcaT f M gcc tta taa tta tct agt acc tct att aat ttt act att oat 

52 tct cgc aaa taa ata iagt att tta 7 ctg ttt teg taa cag ttt tgt aat 

aga gcg ttt att tat tca taa aat gac aaa age att gtc aaa ala ?X ttt 

103 aaa acc tat aaa tat tec gga tta ttc ata ccg tec cac cat egg gcg doo 

(|£\ tgg ata ttt; *£c agg cc J^l aag tat gta gel %U SI 

154 atcAtc frca agt t ^t^g/ ala^ aay^c^gaa >c g a c**a ao a taa^rfat " 2Tgv 

tag tag t c a a 3? atq ttt^ tt^caT g^gt^t/gg ^^^t^j^ 



SnaB I 6525 
Eds I 6396 



A/of I 285 



77/7 111 1 5871 



PvuU 645 
EcoR I 747 



BseR I 5592 
EcoR V 5469 
SstW 5411 



Sap t 5032 




Ateol 1048 
Mlu \ 1194 
Ba/nH I 1200 

SssH II 1241 



Srf\ 1615 
Sma M616 

PsM 1869 
BspM I 1871 
Sa/I 1875 

M/nl2123 
Hpa I 21 34 
Bell 2272 
AvrU 2287 



Fsp I 3797 / 

Pvul3650 / 
BsaH I 3481 



NgoA IV 2773 
Ban II 2803 
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pDEST8 6526 bp 



Location (Base Nos 
23 . . 152 
284 . .160 
534 . . 1193 
1313 . . 1397 
1535 . . 1840 
1881 . .2005 
2766 . .3146 
3240 . .4090 
4289 . .4869 
5564 . .6496 



Gene Encoded 

Ppolh 

attRl 

CmR 

inactivated ccdA 

ccdB 

attR2 

f 1 

ampR 

ori 

genR 



1 C GTATACT C C GGAATATTAA TAGATCATGG AGATAATTAA AATGATAACC ATCTCGCAAA 

61 TAAATAAGTA TTTTACTGTT TTCGTAACAG TTTTGTAATA AAAAAACCTA TAAATATTCC 

121 GGATTATTCA TACCGTCCCA CCATCGGGCG C G GAT CAT C A CAAGTTTGTA CAAAAAAGCT 

181 G AAC GAG AAA CGTAAAATGA TATAAATATC AATATATTAA ATTAGATTTT GCATAAAAAA 

241 CAGACTACAT AATACTGTAA AACACAACAT ATCCAGTCAC TATGGCGGCC GCTAAGTTGG 

3 01 CAGCATCACC CGACGCACTT TGCGCCGAAT AAATACCTGT GACGGAAGAT CACTTCGCAG 

3 61 AATAAATAAA TCCTGGTGTC CCTGTTGATA CCGGGAAGCC CTGGGCCAAC TTTTGGCGAA 

421 AATGAGACGT TGATCGGCAC GTAAGAGGTT CCAACTTTCA CCATAATGAA ATAAGATCAC 

481 TACCGGGCGT ATTTTTTGAG TTAT C GAG AT TTTCAGGAGC TAAGGAAGCT AAAATG GAGA 

541 AAAAAATCAC TGGATATACC ACCGTTGATA TATCCCAATG GCATCGTAAA GAACATTTTG 

6 01 AGGCATTTCA GTCAGTTGCT CAATGTACCT ATAACCAGAC CGTTCAGCTG GATATTACGG 

661 CCTTTTTAAA G AC CGTAAAG AAAAATAAGC ACAAGTTTTA TCCGGCCTTT ATTCACATTC 

721 TTGCCCGCCT GATGAATGCT CATCCGGAAT TCCGTATGGC AATGAAAGAC GGTGAGCTGG 

781 TGATATGGGA TAGTGTTCAC C CTTGTTAC A CCGTTTTCCA TGAGCAAACT GAAACGTTTT 

841 CATCGCTCTG GAG TG AATAC CACGACGATT TCCGGCAGTT TCTACACATA TATTCGCAAG 

901 ATGTGGCGTG TTACGGTGAA AACCTGGCCT ATTTCCCTAA AGGGTTTATT G AG AATATG T 

961 TTTTCGTCTC AGCCAATCCC TGGGTGAGTT TCACCAGTTT TGATTTAAAC GTGGCCAATA 

1021 TGGACAACTT CTTCGCCCCC GTTTTCACCA TGGGCAAATA TTATACGCAA GGCGACAAGG 

1081 TGCTGATGCC GCTGGCGATT CAGGTTCATC ATGCCGTCTG TGATGGCTTC CATGTCGGCA 

1141 GAATGCTTAA TGAATTACAA CAGTACTGCG ATGAGTGGCA GGGCGGGGCG TAAACGCGTG 

12 01 GATCCGGCTT ACTAAAAGCC AGATAACAGT ATGCGTATTT GCGCGCTGAT TTTTGCGGTA 

12 61 TAAGAATATA TACTGATATG TATACCCGAA GTATGTCAAA AAGAGGTGTG C TATGAAGC A 
1321 GCGTATTACA GTGACAGTTG ACAGCGACAG CTATCAGTTG CTCAAGGCAT ATATGATGTC 

13 81 AATATCTCCG GTCTGGTAAG CACAACCATG CAGAATGAAG CCCGTCGTCT GCGTGCCGAA 
1441 CGCTGGAAAG CGGAAAATCA GGAAGGGATG GCTGAGGTCG CCCGGTTTAT TGAAATGAAC 
15 01 GGCTCTTTTG CTGACGAGAA CAGGGACTGG TGAAATGCAG TTTAAGGTTT ACAC CTATAA 

15 61 AAG AG AG AG C CGTTATCGTC TGTTTGTGGA T G T AC AG AG T GATATTATTG ACACGCCCGG 
1621 GCGACGGATG GTGATCCCCC TGGCCAGTGC ACGTCTGCTG TCAGATAAAG TCTCCCGTGA 

16 81 ACTTTACCCG GTGGTG'CATA TCGGGGATGA AAGCTGGCGC ATGATGACCA CCGATATGGC 
1741 CAGTGTGCCG GTCT'CCGTTA TCGGGGAAGA AGTGG CTGAT CTCAGCCACC G C G AAAATG A 

/ 1.8 01 CATCAAAAAC GCCATTAACC TGATGTTCTG GGGAATATAA ATGTCAGGCT CCCTTATACA 

i861 CAGCCAGTCT GCAGGTCGAC CATAGTGACT GGATATGTTG TGTTTTACAG TATTATGTAG 

1921 TCTGTTTTTT ATG CAAAATC TAATTTAATA TATTGATATT TAT AT C ATTT TACGTTTCTC 

1981 GTTCAGCTTT CTTGTACAAA GTGGTGATAG CTTGTCGAGA AGTAC TAG AG GATCATAATC 

2 041* AG C CAT AC C A CATTTGTAGA GGTTTTACTT GCTTTAAAAA ACCTCCCACA CCTCCCCCTG 

2101 AACCTGAAAC ATAAAATGAA TGCAATTGTT GTTGTTAACT TGTTTATTGC AG CTTATAAT 

2161 GGTTACAAAT AAAGCAATAG CATCACAAAT TTCACAAATA AAG C ATTTTT TTCACTGCAT 

2221 TCTAGTTGTG GTTTGTCCAA ACTCATCAAT GTAT C TTAT C ATGTCTGGAT CTGATCACTG 

2281 CTTGAGCCTA GGAGATCCGA AC C AGATAAG T.G AAAT C TAG TTCCAAACTA TTTTGTCATT 

2 341 TTTAATTTTC GTATTAGCTT ACGACG CTAC ACCCAGTTCC CAT CTATTTT GTCACTCTTC 

24 01 CCTAAATAAT CCTTAAAAAC TCCATTTCCA CCCCTCCCAG TTCCCAACTA TTTTGTCCGC 

2461 CCACAGCGGG GC ATTTTT CT TCCTGTTATG TTTTTAATCA AACATCCTGC CAACTCCATG 

2 521 TGACAAACCG TCATCTTCGG CTACTTTTTC TCTGTCACAG AATGAAAATT TTTCTGTCAT- 
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25 81 CTCTTCGTTA TTAATGTTTG TAATTGACTG AATATCAACG C TTATTTGC A GCCTGAATGG 
2 641 CGAATGGACG CGCCCTGTAG CGGCGCATTA AGCGCGGCGG GTGTGGTGGT TACGCGCAGC 

27 01 GTGACCGCTA CACTTGCCAG CGCCCTAGCG CCCGCTCCTT TCGCTTTCTT CCCTTCCTTT 
2761 CTCGCCACGT TCGCCGGCTT TCCCCGTCAA GCTCTAAATC GGGGGCTCCC TTTAGGGTTC 

28 21 CGATTTAGTG CTTTACGGCA CCTCGACCCC AAAAAACTTG ATTAGGGTGA TGGTTCACGT 
28 81 AGTGGGCCAT CGCCCTGATA GACGGTTTTT CGCCCTTTGA CGTTGGAGTC CACGTTCTTT 

2 941 AATAGTGGAC TCTTGTTCCA AACTGGAACA ACACTCAACC CTATCTCGGT CTATTCTTTT 

3 0 01 GATTTATAAG GGATTTTGCC GATTTCGGCC TATTGGTTAA AAAATG AG C T GATTTAACAA 
3 061 AAATTTAACG CGAATTTTAA CAAAATATTA ACGTTTACAA TTTCAGGTGG CACTTTTCGG 
3121 GG AAATGTG C GCGGAACCCC TATTTGTTTA TTTTT CTAAA T AC ATT C AAA TATGTATCCG 
3181 CTCATGAGAC AATAACCCTG ATAAATG CTT CAATAATATT GAAAAAGGAA GAGTATGAGT 
3241 ATTCAACATT TCCGTGTCGC CCTTATTCCC TTTTTTGCGG CATTTTGCCT TCCTGTTTTT 
3 3 01 GCTCACCCAG AAACG CTGGT GAAAGTAAAA GATGCTGAAG ATCAGTTGGG TGCACGAGTG 
3 361 GGTTACATCG AACTGGATCT CAACAGCGGT AAGATCCTTG AGAGTTTTCG CCCCGAAGAA 
3 421 CGTTTTCCAA TGATGAGCAC TTTTAAAGTT CTGCTATGTG GCGCGGTATT ATCCCGTATT 
3 481 GACGCCGGGC AAGAGCAACT CGGTCGCCGC ATACACTATT CTCAGAATGA CTTGGTTGAG 
3 541 TACT C AC C AG T C AC AG AAAA GCATCTTACG GATGGCATGA CAGTAAGAGA ATTATGCAGT 
36 01 GCTGCCATAA C CAT G AGTG A TAACACTGCG GCCAACTTAC TTCTGACAAC GATCGGAGGA 
3661 CCGAAGGAGC TAACCGCTTT TTTGCACAAC ATGGGGGATC ATGTAACTCG CCTTGATCGT 
3721 TGGGAAC CGG AG CTG AATGA AGCCATACCA AACGACGAGC GTG AC AC C AC GATGCCTGTA 
3 781 GCAATGGCAA CAACGTTGCG CAAACTATTA ACTGGCGAAC T AC TT AC T C T AGCTTCCCGG 
3 841 CAACAATTAA TAGACTGGAT GGAGGCGGAT AAAGTTG C AG GACCACTTCT GCGCTCGGCC 
3 901 CTTCCGGCTG GCTGGTTTAT TGCTGATAAA TCTGGAGCCG GTGAGCGTGG GTCTCGCGGT 

3 961 ATCATTG CAG CACTGGGGCC AGATGGTAAG CCCTCCCGTA TCGTAGTTAT CTACACGACG 

4 021 GGGAGTCAGG CAACTATGGA TGAACGAAAT AG AC AG AT C G CTGAGATAGG TGCCTCACTG 
4 081 ATTAAGCATT GGTAACTGTC AG AC C AAGTT TACTCATATA TACTTTAGAT TGATTTAAAA 
4141 CTTCATTTTT AATTTAAAAG GATCTAGGTG AAGATCCTTT TTGATAATCT C ATG AC C AAA 
42 01 ATCCCTTAAC GTGAGTTTTC GTTCCACTGA GCGTCAGACC C C GT AG AAAA GATCAAAGGA 
4261 TCTTCTTGAG ATC CTTTTTT TCTGCGCGTA ATCTGCTGCT TGCAAACAAA AAAACCACCG 
4 321 CTACCAGCGG TGGTTTGTTT GCCGGATCAA GAGCTAC CAA CTCTTTTTCC GAAGGTAACT 
4381 GG CTT CAG C A GAGCGCAGAT ACCAAATACT GTCCTTCTAG TGTAGC CGTA GTTAGGCCAC 
4 441 CACTTCAAGA ACTCTGTAGC ACCGCCTACA TACCTCGCTC TGCTAATCCT GTTACCAGTG 
4 5 01 GCTGCTGCCA GTGGCGATAA GTCGTGTCTT ACCGGGTTGG ACTCAAGACG AT AGTT AC C G 
4 561 GATAAGGCGC AGCGGTCGGG CTGAACGGGG GGTTCGTGCA CACAGCCCAG CTTGGAGCGA 
4 621 AC G AC C T AC A CCGAACTGAG ATAC CTACAG CGTGAGCATT GAGAAAGCGC CACGCTTCCC 
4 681 GAAGGGAGAA AGGCGGACAG GTATCCGGTA AGCGGCAGGG TCGGAACAGG AG AG C G C AC G 
4 741 AGGG AG C TT C CAGGGGGAAA CGC CTGGT AT CTTTATAGTC CTGTCGGGTT TCGCCACCTC 
48 01 TGACTTGAGC GTCGATTTTT GTGATGCTCG TCAGGGGGGC GGAGCCTATG GAAAAACGCC 
4 861 AGCAACGCGG CCTTTTTACG GTTCCTGGCC TTTTGCTGGC CTTTTGCTCA CATGTTCTTT 
4 921 CCTGCGTTAT 'CCCCTGATTC TGTGGATAAC CGTATTACCG CCTTTGAGTG AGCTGATACC 

4 981 GCTCGCCGCA GCCGAACGAC CGAGCGCAGC GAGTCAGTGA GCGAGGAAGC GGAAG AG CGC 

5 041 CTGATGCGGT ATTTTCTCCT TACGCATCTG TGCGGTATTT CACACCGCAG ACCAGCCGCG 
5101 TAACCTGGCA AAATCGGTTA CGGTTGAGTA ATAAATG GAT GCCCTGCGTA AGCGGGTGTG 
5161 GGCGGACAAT AAAGTCTTAA ACTGAACAAA ATAGATCTAA ACT ATG AC AA TAAAGTCTTA 
5221 AACTAGACAG AATAGTTGTA AACTGAAATC AGTCCAGTTA TG CTGTG AAA AAG CAT AC TG 
5281 GACTTTTGTT ATGGCTAAAG CAAACTCTTC ATTTTCTGAA GTG C AAATTG CCCGTCGTAT 
5341 TAAAGAGGGG CGTGGCCAAG GGCATGGTAA AGACTATATT CGCGGCGTTG TGACAATTTA 
54 01 CCGAACAACT CCGCGGCCGG G AAG C C GAT C TCGGCTTGAA CGAATTGTTA GGTGGCGGTA 
54 6 r CTTGGGTCGA TAT C AAAGTG CATCACTTCT TCCCGTATGC C CAACTTTGT AT AG AG AG C C 
5 521 ACTGCGGGAT CGTCACCGTA ATCTGCTTGC AC G TAG AT C A C AT AAG C AC C AAGCGCGTTG 
5581 GCCTCATGCT TGAGGAGATT GATGAGCGCG GTGGCAATGC CCTGCCTCCG GTGCTCGCCG 
5641 GAG AC T G C G A GATCATAGAT ATAGATCTCA CTACGCGGCT GCTCAAACCT GGGCAGAACG 
57 01 TAAGCCGCGA GAG CGC C AAC AACCGCTTCT TGGTCGAAGG CAGCAAGCGC GATGAATGTC 

57 61 TTACT AC G G A GCAAGTTCCC GAGGTAATCG GAGTCCGGCT GATGTTGGGA GTAGGTGGCT 
5821 ACGTCTCCGA ACTCACGACC GAAAAGATCA AG AG CAG C C C G C ATG G ATT T GACTTGGTCA 

58 81 GGGCCGAGCC TACATGTGCG AATGATGCCC ATACTTGAGC C AC CTAACTT TGTTTTAGGG 

5 941 CGACTGCCCT GCTGCGTAAC ATCGTTGCTG CTGCGTAACA TCGTTGCTGC T C CAT AAC AT 

6 001 CAAACATCGA CCCACGGCGT AACGCGCTTG CTGCTTGGAT GCCCGAGGCA TAGACTGTAC - 
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6 061 AAAAAAACAG TCATAACAAG CCATGAAAAC 

6121 GGTCAAGGTT CTGGACCAGT TGCGTGAGCG 

6181 CGAACAGGCT TATGTCAACT GGGTTCGTGC 

6241 GGCAACCTTG GGCAGCAGCG AAGTCGAGGC 

6 3 01 GGTTTCGGTC TCCACGCATC G T C AG G C ATT 

6 361 GCTGTGCACG GATCTGCCCT GGCTTCAGGA 

6421 GCCGGTGGTG CTGACCCCGG ATGAAGTGGT 

64 81 TCGTTTGTTC GCCCAGGACT CTAG CTATAG 



CGCCACTGCG CCGTTACCAC CGCTGCGTTC 

CAT AC G CT AC TTGCATTACA GTTTACGAAC 

CTTCATCCGT TTCCACGGTG TGCGTCACCC 

ATTTCTGTCC TGGCTGGCGA ACGAGCGCAA 

GGCGGCCTTG CTGTTCTTCT ACGGCAAGGT 

GATCGGAAGA CCTCGGCCGT CGCGGCGCTT 

TCGCATCCTC GGTTTTCTGG AAGGCGAGCA 

TTCTAGTGGT TGGCTA 
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Figure Z^fo -pfrfrT*! SemHld Forest Virus vector 



103 ttg gcg agg gac att aag gcg ttt aag aaa ttg aga gga cct gtt ata 

aac cgc tec ctg taa ttc cgc aaa ttc ttt aac tct cct gga caa tat gfcg 

154 —etc tac* Jg^ggt cct agC t^y^tg cgt taa tac aca gaa ttc tga tto Tgat 
ata cca cca gga tct aac cfaic gca att atg tgt ctt aag act aac eta 




205 ccc gjjt^cg aag cgc get ttc cca teaiaca aac &3^gac <aa? 

"e^gc ttc gcg cga aag ggt agt {tat tea aac atg ttt 
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pDEST9 12464 bp 



Location (Base Nos . ) 
355 . .232 



605 . . 1264 
1384 . . 1468 
1606 . . 1911 
1952 . .2078 
2532 . .2782 
3482 . .4282 
5232 . . 5365 
5365. .6965 
6965 . . 9265 
9265 . .10865 
10865 . . 161 



Gene Encoded 

attRl 

CmR 

inactivated ccdA 

ccdB 

attR2 

ori 

ampR 

SP6 promoter 
nsPl : non- structural 
nsP2 : non-structural 
nsP3 : non-structural 
nsP4 : non-structural 



protein 1 
protein 2 
protein 3 
protein 4 



1 AGCAAGTGGT TCCGGACAGG CTTGGGGGCC GAACTGGAGG TGGCACTAAC ATCTAGGTAT 
61 GAGGTAGAGG GCTGCAAAAG TATCCTCATA GCCATGGCCA CCTTGGCGAG GGACATTAAG 
121 GCGTTTAAGA AATTGAGAGG AC CTGTTATA CACCTCTACG GCGGTCCTAG ATTGGTG CGT 
181 TAATACACAG AATT CTGATT GGATCCCGGT CCGAAGCGCG CTTTCCCATC ACAAGTTTGT 
241 ACAAAAAAGC TGAACGAGAA AC GTAAAATG ATATAAATAT CAATATATTA AATTAGATTT 
3 01 TGCATAAAAA ACAGACTACA TAATACTGTA AAACACAACA TATCCAGTCA CTATGGCGGC 
361 CGCTAAGTTG G C AG CAT C AC CCGACGCACT TTGCGCCGAA TAAATAC CTG TGACGGAAGA 
421 TCACTTCGCA GAATAAATAA ATCCTGGTGT CCCTGTTGAT ACCGGGAAGC CCTGGGCCAA 
481 CTTTTGGCGA AAATGAGACG TTGATCGGCA CGTAAGAGGT TCCAACTTTC AC C AT AATG A 
541 AATAAGATCA CTACCGGGCG TATTTTTTGA GTTATCGAGA TTTT C AG GAG CTAAGGAAGC 
6 01 TAAAATG GAG AAAAAAATCA CTGGATATAC C AC C GTTG AT ATATCCCAAT GGCATCGTAA 
661 AGAACATTTT GAGG CATTTC AGTCAGTTGC T C AATGTAC C TATAACCAGA CCGTTCAGCT 
721 GGATATTACG GCCTTTTTAA AGAC CGTAAA GAAAAATAAG CACAAGTTTT ATCCGGCCTT 
781 TATTCACATT CTTGCCCGCC TGATG AATG C TCATCCGGAA TTCCGTATGG CAATGAAAGA 
841 CGGTGAGCTG GTGATATGGG ATAGTGTTCA CCCTTGTTAC ACCGTTTTCC ATGAGCAAAC 
901 TGAAACGTTT TCATCGCTCT GGAGTGAATA CCACGACGAT TTCCGGCAGT TTCTACACAT 
961 ATATTCGCAA GATGTGGCGT GTTACGGTGA AAACCTGGCC TATTTCCCTA AAGGGTTTAT 
1021 TGAGAATATG TTTTTCGTCT CAGCCAATCC CTGGGTGAGT TTCACCAGTT TTGATTTAAA 
1081 CGTGGCCAAT ATGGACAACT TCTTCGCCCC CGTTTTCACC ATGGGCAAAT ATTATACGCA 
1141 AGGCGACAAG GTGCTGATGC CGCTGGCGAT TCAGGTTCAT CATGCCGTCT GTGATGGCTT 
12 01 CCATGTCGGC AG AATG CTTA ATGAATTACA ACAGTACTGC GATGAGTGGC AGGGCGGGGC 

12 61 GTAAAGATCT GGATCCGGCT TACTAAAAGC CAGATAACAG TATGCGTATT TGCGCGCTGA 
1321 TTTTTGCGGT ATAAGAATAT ATACTGATAT GTATACCCGA AGTATGTCAA AAAGAGGTGT 

13 81 GCTATGAAGC AGCGTATTAC AGTGACAGTT GACAGCGACA G CTATCAGTT GCTCAAGGCA 
1441 TATATGATGT CAATATCTCC GGTCTGGTAA G C AC AAC CAT G CAGAATG AA GCCCGTCGTC 
15 01 TGCGTGCCGA ACGCTGGAAA GCGGAAAATC AGGAAGGGAT GGCTGAGGTC GCCCGGTTTA 
1561 TTGAAATGAA CGGCTCTTTT GCTGACGAGA ACAGGGACTG GTGAAATGCA GTTTAAGGTT 
1621 TACACCTATA AAAGAGAGAG CCGTTATCGT CTGTTTGTGG ATGTACAGAG TGATATTATT 

,16 81 GACACGCCCG GGCGACGGAT GGTGATCCCC CTGGCCAGTG CACGTCTGCT G T C AG AT AAA 
1741 GTCTCCCGTG AACTTTACCC GGTGGTGCAT ATCGGGGATG AAAGCTGGCG CATGATGACC 
18 01 ACCGATATGG CCAGTGTGCC GGTCTCCGTT ATCGGGGAAG AAGTGGCTGA TCTCAGCCAC 
18 61 CGCGAAAATG ACATCAAAAA CGCCATTAAC CTGATGTTCT GGGGAATATA AATGTCAGGC 
1921- TCCCTTATAC ACAGCCAGTC TGCAGGTCGA C CAT AG T G AC TGGATATGTT GTGTTTTACA 
1981 GTATTATGTA GTCTGTTTTT TATGCAAAAG TGCTAATTTA ATATATTGAT ATT TAT AT C A 
2 041 TTTTAC GTTT CTCGTTCAGC TTTCTTGTAC AAAGTGGTGA TGGGAACTCG AGTTCACTAG 
2101 TCGATCCCGC GGCCGCTTTC GAACCTAGGC AAGCATGCGG GCCCAGTGGG TAATTAATTG 
2161 AATT AC AT C C CTACG CAAAC GTTTTACGGC CGCCGGTGGC GCCCGCGCCC GGCGGCCCGT 
2221 CCTTGGCCGT TGCAGGCCAC TCCGGTGGCT CCCGTCGTCC CCGACTTCCA GGCCCAGCAG 
22 81 ATGCAGCAAC TCATCAGCGC CGTAAATGCG CTGACAATGA GACAGAACGC AATTGCTCCT 
2341 GCTAGGAGCT TAATTCGACG AATAATTGGA TTTTTATTTT ATTTTGCAAT TGGTTTTTAA 
24 01 TATTTCCAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA- 
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24 61 AAAAAAAAAA AAAAAAACTA GAAATCGCGA TTTCTAGTCT GCATTAATGA ATCGGCCAAC 

2 521 GCGCGGGGAG AGG CGGTTTG CGTATTGGGC GCTCTTCCGC TTCCTCGCTC ACTGACTCGC 

2 581 TGCGCTCGGT CGTTCGGCTG CGGCGAGCGG TAT C AG C T C A CTCAAAGGCG GTAATACGGT 

2 641 TATCCACAGA ATCAGGGGAT AACGCAGGAA AGAACATGTG AGCAAAAGGC CAGCAAAAGG 

2 701 CCAGGAACCG TAAAAAGGCC GCGTTGCTGG CGTTTTTCCA TAGGCTCCGC CCCCCTGACG 

2 761 AG CATCACAA AAATCGACGC TCAAGTCAGA GGTGGCGAAA CCCGACAGGA CTATAAAGAT 

2 821 ACCAGGCGTT TCCCCCTGGA AGCTCCCTCG TGCGCTCTCC TGTTCCGACC CTGCCGCTTA 

2 881 CCGGATACCT GTCCGCCTTT CTCCCTTCGG GAAGCGTGGC GCTTTCTCAA TGCTCGCGCT 

2 941 GTAGGTATCT CAGTTCGGTG TAGGTCGTTC GCTCCAAGCT GGGCTGTGTG CACGAACCCC 

3 001 CCGTTCAGCC CGACCGCTGC GCCTTATCCG GTAACTATCG TCTTGAGTCC AACCCGGTAA 
3 061 GACACGACTT ATCGCCACTG GCAGCAGCCA CTGGTAACAG GATTAGCAGA GCGAGGTATG 
3121 TAGGCGGTGC TACAGAGTTC TTGAAGTGGT GGCCTAACTA CGGCTACACT AGAAGGACAG 
3181 TATTTGGTAT CTGCGCTCTG CTGAAGCCAG TTACCTTCGG AAAAAGAGTT GGTAGCTCTT 
3241 GATCCGGCAA ACAAACCACC GCTGGTAGCG GTGGTTTTTT TGTTTGCAAG CAGCAGATTA 

33 01 CGCGCAGAAA AAAAGGATCT CAAGAAGATC CTTTGATCTT TTCTACGGGG TCTGACGCTC 
3361 AGTGGAACGA AAACTCACGT TAAGGGATTT TGGTCATGAG ATT AT C AAAA AGGATCTTCA 
3421 CCTAGATCCT TTTAAATTAA AAATGAAGTT TTAAATCAAT CTAAAGTATA TATGAGTAAA 

34 81 CTTGGTCTGA CAGTTACCAA TGCTTAATCA GTGAGGCACC TATCTCAGCG ATCTGTCTAT 
3 541 TTCGTTCATC CATAGTTGCC TGACTCCCCG TCGTGTAGAT AACTACGATA CGGGAGGGCT 
3 6 01 TAC CATCTGG CCCCAGTGCT GCAATGATAC CGCGAGACCC ACGCTCACCG G C T C C AG ATT 
3 661 TATCAGCAAT AAACCAGCCA GCCGGAAGGG CCGAGCGCAG AAGTGGTCCT GCAACTTTAT 
3721 CCGCCTCCAT CCAGTCTATT AATTGTTGCC GGGAAGCTAG AGTAAGTAGT TCGCCAGTTA 
3781 ATAGTTTGCG CAACGTTGTT GCCATTGCTA CAGGCATCGT GGTGTCACGC TCGTCGTTTG 
3 841 GTATGGCTTC ATTCAGCTCC GGTTCCCAAC GATCAAGGCG AGTTACATGA TCCCCCATGT 
3 901 TGTGCAAAAA AGCGGTTAGC TCCTTCGGTC CTC CGATCGT TGTCAGAAGT AAGTTGGCCG 

3 961 CAGTGTTATC ACTCATGGTT ATGGCAG CAC TGCATAATTC TCTTACTGTC ATGCCATCCG 

4 021 TAAGATGCTT TTCTGTGACT GGTGAGTACT CAACCAAGTC ATTCTGAGAA TAGTGTATGC 
4 081 GGCGACCGAG TTGCTCTTGC CCGGCGTCAA TACGGGATAA TACCGCGCCA CAT AG C AG AA 
4141 CTTTAAAAGT GCTCATCATT GGAAAACGTT CTTCGGGGCG AAAACTCTCA AGGATCTTAC 
42 01 CGCTGTTGAG ATCCAGTTCG ATGTAAC C C A CTCGTGCACC CAACTGATCT TCAGCATCTT 

42 61 TTACTTTCAC CAGCGTTTCT GGGTG AG C AA AAACAGGAAG GCAAAATGCC GCAAAAAAGG 
4321 GAATAAGGGC GACACGGAAA TGTTGAATAC TCATACTCTT C CTTTTTC AA TATTATTGAA 

43 81 GCATTTATCA GGGTTATTGT CTCATGAGCG GATACATATT TGAATGTATT TAGAAAAATA 
4441 AACAAATAGG GGTTCCGCGC ACATTTCCCC GAAAAGTGCC ACCTGACGTC TAAGAAACCA 
45 01 TTATTATCAT G AC ATTAAC C TATAAAAATA GGCGTATCAC GAGGCCCTTT CGTCTCGCGC 
4 561 GTTTCGGTGA TGACGGTGAA AACCTCTGAC ACATGCAGCT CCCGGAGACG GTCACAGCTT 
4 621 CTGTCTAAGC GGATGCCGGG AGCAGACAAG CCCGTCAGGG CGCGTCAGCG GGTGTTGGCG 
4681 GGTGTCGGGG CTGGCTTAAC TATGCGGCAT CAGAGCAGAT TGTACTGAGA GTGCAC CATA 
4 741 TCGACGCTCT CCCTTATGCG ACTCCTGCAT TAGGAAGCAG C C CAGTACTA GGTTGAGGCC 
4801 GTTGAGCACC GCCGCCGCAA GGAATGGTG C ATGCAAGGAG ATGGCGCCCA ACAGTCCCCC 
4861 GGCCACGGGG CCTGCCACCA TACCCACGCC GAAACAAGCG CTCATGAGCC CGAAGTGGCG 
4 921 AGCCCGATCT TCCCCATCGG TGATGTCGGC GATATAGGCG CCAGCAACCG CACCTGTGGC 

4 981 GCCGGTGATG CCGGCCACGA TGCGTCCGGC GTAGAGGATC TGGCTAGCGA TGACCCTGCT 

5 041 GATTGGTTCG CTGAC CATTT CCGGGGTGCG GAACGGCGTT ACCAGAAACT CAGAAGGTTC 
,5101 GTCCAACCAA ACCGACTCTG ACGGCAGTTT ACGAGAGAGA TGATAGGGTC TGCTTCAGTA 

5161 AGCCAGATGC TACACAATTA GGCTTGTACA TATTGTCGTT AGAACGCGGC TACAATTAAT 

5221 AC ATAAC CTT ATGTATCATA CACATACGAT TTAGGTGACA CTATAGATGG CGGATG TGTG 

52 81 ACATACACGA CGCCAAAAGA TTTTGTTCCA GCTCCTGCCA CCTCCGCTAC GCGAGAGATT 

5341- AACCACCCAC GATGGCCGCC AAAGTG CATG TTGATATTGA G G C TG AC AG C CCATTCATCA 

54 01 AGTCTTTGCA GAAGG CATTT CCGTCGTTCG AGGTGGAGTC ATTGCAGGTC AC AC C AAATG 

54 61 AC CATG CAAA TG C C AG AG C A TTTTCG CAC C TGGCTACCAA ATTGATCGAG CAGGAGACTG 

5 521 ACAAAGACAC ACTCATCTTG GATATCGGCA GTGCGCCTTC CAGGAGAATG ATGTCTACGC 

5 581 ACAAATACCA CTGCGTATGC CCTATGCGCA G CG C AG AAG A CCCCGAAAGG CTCGATAGCT 

5641 ACG CAAAGAA ACTGGCAGCG GCCTCCGGGA AGGTGCTGGA TAG AG AG AT C GCAGGAAAAA 

57 01 TCACCGACCT GCAGACCGTC ATGGCTACGC CAGACGCTGA ATCTCCTACC TTTTGCCTGC 

57 61 ATACAGACGT CACGTGTCGT ACGGCAGCCG AAGTGGCCGT AT AC C AGG AC GTGTATGCTG 

5821 TACATGCACC AACATCGCTG TACCATCAGG CGATGAAAGG TGTCAGAACG GCGTATTGGA 

5881 TTGGGTTTGA CACCACCCCG TTTATGTTTG ACGCGCTAGC AGGCGCGTAT CCAACCTACG- 
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5 941 CCACAAACTG GGCCGACGAG CAGGTGTTAC 

6 001 CCTTGACTGA GGGAAGACTC GGCAAACTGT 
6 061 GCGACACAGT CATGTTCTCG GTAGGATCTA 
6121 GGAGCTGGCA CTTACCCTCC GTATTCCACC 
6181 GCGATAC CAT CGTATCATGT GAAGGGTACG 
6241 TGTACGGTAA AACGGTAGGG TACGCCGTGA 
63 01 AGACCACAGA CACTGTCAAA GGAGAAAGAG 

63 61 CAACCATCTG TGATCAAATG ACTGGCATAC 
6421 AGAAGTTGTT AGTGGGATTG AATCAGAGGA 

64 81 CTAACACGAT GAAGAACTAT CTGCTTCCGA 
6 541 GGGAATACAA GG C AG AC C TT GATGATGAAA 
66 01 CTTGCTGCTG CTTGTGGGCA TTTAAAACGA 
6661 ACACCCAGAC AATAGTGAAG GTGCCTTCAG 
6 721 GGTCTACAGG CCTCGCAATC CCAGTCAGAT 
6 781 CCAAGCGAGA GTTAATACCT GTTCTCGACG 
6 841 AGAAGGAGAG GTTGGAGGCC GAGCTGACTA 
6 901 CGCCGGCGGA GACGGGAGTC GTCGACGTCG 

6 961 CAGGGGTCGT GGAAAC AC CT CGCAGCGCGT 

7 021 TACTAGGAAA TTACGTAGTT CTGTCCCCGC 
7 081 CCGTGCACCC TCTAG CAGAG CAGGTGAAAA 
7141 ACCAGGTCGA CGGATATGAC GGCAGGGTCC 

72 01 CTGAGTTTCA GGCTTTGAGC GAGAGCGCCA 
7261 ACAGGAAACT ATACCATATT GCCGTTCACG 
7321 ACGAGAAAGT CAGAG CTGAA AGAACTGACG 

73 81 GCTGCGTCAA GAGAGAGGAA GCGTCGGGTT 
7441 CGTTCCATGA ATTCGCCTAC GAAGGGCTGA 

75 01 CAGTAGTAGG AGTCTTTGGG GTTCCGGGAT 
7561 TGACCAAACA CGATCTGGTC ACCAGCGGCA 
7621 ACGTGAAGAA GCACCGCGGG AAGGGGACAA 

76 81 ACGGGTGTCG TCGTGCCGTG G AC AT C CTAT 
7741 GTACTCTGCT GGCCCTAATT GCTCTTGTTA 
78 01 ACCCCAAGCA ATGCGGATTC TTCAATATGA 
78 61 TCTGCACTGA AGTATGTCAT AAAAGTATAT 

7 921 TCGTGTCTAC GTTGCACTAC GGAGG CAAGA 
7981 T AAT CAT AG A C AC C AC AGGA CAGACCAAGC 

8 041 TCCGAGGCTG GGCAAAGCAG CTGCAGTTGG 
8101 CAGCATCTCA GGGCCTCACC CGCAAAGGGG 
8161 ATCCCTTGTA TGCCCCTGCG TCGGAGCACG 
8221 GGCTGGTGTG GAAAACGCTG GCCGGCGATC 
82 81 AGGGTAACTT TACGGCCACA TTGGAAGAAT 
8 341 TGATTGAAGG ACCGGCTGCG CCTGTGGACG 
84 01 CGAAAAGCCT GGTGCCTGTC CTGGACACTG 
84 61 GCACCATAAT TAC AG CATTT AAGGAGGACA 
8 521 AAATTTG C AC CAAGTACTAT GGAGTTGACC 
,8 581 TGTCCCTGTA TTACGAGAAC AAC CACTGGG 
'ff'641 TCAATGCCGC AACAGCTGCC AGGCTGGAAG 

87 01 ATACGGGCAA GCAGGCAGTT ATC G C AG AAA 
8761 ATGTAATTCC TATCAAC CGC AGGCTGCCGC 
8 82 It AAGGCAGTAG GGTTGAGTGG CTGGTCAATA 

88 81 GTGAGTACAA CCTGGCTTTG CCTCGACGCA 
8 941 CAGGCGCCGA TAGGTGCTAC GACCTAAGTT 
9001 ACTTGGTCTT TGTGAACATT CACACGGAAT 
9061 ACCACGCCAT GAAGCTGCAG ATG CTTGGGG 
9121 G C ATCTTG AT GAGAGCTTAC GGATACGCCG 
9181 TAAGCAGAAA GTTCTCGTCT GCAAGAGTGT 

92 41 AAGTGTTCTT GCTGTTCTCC AACTTTGACA 

93 01 TGAATACCAA GCTGAGTGCC GTGTATGCCG 
93 61 CATCCTACAG AGTTAAGAGA GCAGACATAG 
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CAAGAGCTCC 


AAGTTG 
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TAACGGGAGG 
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TAC TAC C ATG 


TGGATCGGCC 


ATTCCG 


GTCC 


CTATGGTGTA 


CAACGAAAGG 
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GTCA 
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GAACAC CGAC 


GAGGAG. 


AACT 


CCGAGTACGT 
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AAAT 
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ATC CTG 
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TGCGCACGAC 


CAACCCGTGC 


AACAAA 


CCCA 


CCAAGCCAGG 


AGACATCGTG 


TTAACA 


TGCT 


ACTACCGTGG 


ACACGAAGTC 


ATGACA 


3CAG 


TATACGCCGT 


AAGGCAGAAG 


GTGAAT 


3 AAA 


TGAATGTACT 


GCTGACGCGC 


ACTGAG 


3ATA 


CCTGGATTAA 


GGTCCTATCA 


AAC ATT 


CCAC 


GGCAAGAAGA 


ACACGACAAA 


ATAATG. 


AAGG 


CGTTCCAGAA 


CAAAGCGAAC 


GTGTGT 


TGGG 


C CGGAATC AG 


ATTGACAGCA 


GAGGAG 


TGGA 


GAGCTTACTC 


TCCAGTGGTG 


GCCTTG. 


AATG 


TGGACAGTGG 


CCTGTTTTCT 


GCCCCG. 


AAGG 


ATAAC AG AC C 


TGGTGGAAGG 


ATGTAT 


3GAT 


CTAGACATAC 


CTTCCTGAAG 


GGGCAG" 


TGGC 


GAAAAATCCA 


ACCGCTTTCT 


GTGCTG 


3ACA 


ACGCCCTGGT 


GGCTGAGTAC 


AAG AC G 


GTTA 


AAGTAAGAGG 


GTACCACGTC 


CTGCTG 


3TGA 


GGGTCACTTG 


GTTGTCACCG 


CTGAAT 


GTCA 


TAGGACTGCC 


GGCTGACGCC 


GGCAGG' 


TTCG 


TCAGAATCCA 


CCACTACCAG 


CAGTGT 


3TCG 


GAGATGCGCT 


ACGACTGCTA 


AAACCC 


3GCG 


ATAAAATCAG 


CGAAGCCGTT 


GTTTCC' 


TCCT 


TGCGCCCGGA 


TTGTGTCACC 


AG C AAT. 


AC AG 


AC G G AAAGAG 


ACCCTCTACG 


CTACAC 


CAGA 


G AG AAG C CAT 


GCACACGGCC 


GGGTGT 


GCAC 


CCACGTGCAC 


AGAAGCGGCT 


GTGGTT. 


AACG - 
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94 21 CAGCTAACGC 
94 81 CGTCAGCCTT 
9541 CGTACCCCGT 
96 01 ACCGCGAATT 

96 61 GCAGCGTAGC 

97 21 AGCAATCCCT 

97 81 ACTGCAGAGA 

98 41 TGGAGTTGCT 
9 9 01 GCAGCCTGGT 
9961 AAGGTACGAA 

10021 GACTGCAAGA 
10081 TCAGATCCAA 
10141 GCCTGTGCCG 
10201 AAAG CATGGT 
10261 AGGTAAAGTG 
10321 GGAAGTATGC 
10381 ACTGGACCAC 
10441 CGTGTGACAT 
10501 ACCCTGAACC 
10561 ATGTGGACCT 
10621 CCCGCGCGGC 
10681 CGTTTAGGAA 
10741 TGGCCTCCGG 
10801 CATATATTTT 
10861 ACAATCTCCA 
10 921 TGGATACTGA 
10 981 ATAAGAGTCG 
11041 TCACATCGGG 
11101 TTCGGTACCC 
11161 TAGCAATCGC 
11221 AGATAACAGA 

112 81 ACAGAGCGAC 

113 41 AGCCGACTGT 

114 01 CGGCTGCCAC 

114 61 CGGCAGTGTT 
11521 AATATG CTAA 

115 81 TGAAAGGCCC 

116 41 AGGTTCCCAT 

117 01 CGAAACACAC 

117 61 CCGCTTACCT 
11821 CTAACGTGCA 

118 81 ACTTCCACCC 
11941 ACGACTCCTT 
12 0 01 TGCTGGACTT 
12 061 CGCGCTTCAA 
12121 CTGTTTTGAA 
12181 GTGCGGCCTT 
12241 CGGAGAGGTG 
12 3 01* AAAAACCCCC 

123 61 GCCGTGTTTC 

124 21 ACAAGCAGGA 
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CCGTGGAACT 
TAAGGGAGCA 
CATCCACGCT 
GGCCGCTGTC 
CATCCCGCTG 
CAACCATCTA 
CAAAAGTTGG 
CAATGATGAC 
GGGTCGTAAG 
ATTCAACCAG 
GGCAAACGAA 
ATGTCCGGTG 
CTACGCAATG 
GGTTTGCTCA 
CGAGAAGGTT 
CGCATCTACG 
CGACTCGTCT 
CGACTCGATC 
CGCAGGCATC 
GGAGAACCCG 
GGAGCGACCG 
CAAGCTGCCT 
GATTACTTTC 
CTCCTCGGAC 
GTGCGCACAA 
GAGGGAGAAG 
ATACCAGTCT 
GGCCAGATTG 
CCGCCCCGTG 
AGCGTGCAAC 
TGAATACGAC 
ATTCTGCCCG 
ACGCAGTGCC 
CAAGAGAAAC 
CAACGTGGAG 
AC AAC CTATC 
GAAAGCTGCT 
GGACAGATTC 
AGAGGAAAGA 
GTGCGGCATC 
CACATTGTTT 
AGGAGACCCG 
GG CTCTTACA 
GATCGAGGCA 
GTTCGGAGCT 
CATCACCATA 
CATCGGCGAC 
CGCGTCGTGG 
ATATTTTTGT 
AGACCCACTT 
CGAAGACAGG 



GTAGGGGATG 
G C AAC AC C AG 
GTAGCGCCTA 
TACCGGGCAG 
CTGTCCACAG 
TT C AC AG C AA 
GAGAAGAAAA 
GTGGAGCTGA 
GG CTACAGTA 
GCTGCTATTG 
CAGATATGCC 
AACGATTCCG 
ACAGCAGAAC 
TCTTTTCCCC 
CTCCTGTTCG 
ACGGACCACT 
TCCACTGCCA 
T AC GAG C C AA 
GCGGACCTGG 
ATTCCTCCAC 
GTGCCGGCGC 
TTGACGTTCG 
GGAGACTTCG 
ACTGGCAGCG 
CTGGATGCGG 
CTGTTGCTGC 
CGCAAAGTGG 
TACACGGGAG 
TACTCCCCTA 
GAATACCTAT 
GCATACTTGG 
GCGAAGCTCC 
GTCCCGTCAC 
TGCAACGTCA 
TGCTTCAAGC 
CGGATAACCA 
GCCTTGTTCG 
ACGGTCGACA 
CCCAAAGTCC 
CACAGGGAAT 
GATATGTCGG 
GTTCTAGAGA 
GGTTTAATGA 
GCCTTTGGGG 
ATGATGAAAT 
GCAAGCAGGG 
GACAACATCG 
GTCAACATGG 
GGGGGATTCA 
AAGCGCCTGT 
CGACGAGCAC 



GCGTATGCAG 
TGGGCACAAT 
ATTTCTCTGC 
TGGCCGCCGA 
GAGTGTTCAG 
TGGACGCCAC 
TCCAGGAAGC 
C C AC AG AC TT 
CCACTGACGG 
ATATGGCAGA 
TATACGCGCT 
ATTCATCAAC 
GGATCGCCCG 
TCCCGAAATA 
ACCCGACGGT 
CAGATCGGTC 
G C G ATAC CAT 
TGGCTCCCAT 
CGGCAGATGT 
CGCGCCCGAA 
CGAGAAAGCC 
GCGACTTTGA 
ACGACGTCCT 
GACATTTACA 
TCCAGGAGGA 
TGAAAATG C A 
AGAACATGAA 
CGGACGTAGG 
CCGTGATCGA 
CCAGAAATTA 
ACATGGTTGA 
GGTGCTACCC 
CCTTTCAGAA 
CGCAAATGCG 
GCTATGCCTG 
CTGAGAACAT 
CTAAG AC C C A 
TGAAACGAGA 
AGGTAATTCA 
TAGTAAGGAG 
CCGAAGACTT 
CGGACATTGC 
TCCTCGAAGA 
AAATATC CAG 
CGGGCATGTT 
TACTGGAGCA 
TTCACGGAGT 
AG GTG AAG AT 
TAGTTTTTGA 
TCAAGTTGGG 
TGAGTGACGA 



GGCCGTGGCG 
TAAAACAGT Z 
CACGACTGAA 
AGTAAACAGA 
CGGCGGAAGA 
GGACGCTGAC 
CATTGACATG 
GGTGAGAGTG 
GTCGCTGTAC 
GATACTGACG 
GGGCGAAACA 
ACCTCCCAGG 
CCTTAGGTCA 
CCATGTAGAT 
AC CTTCAGTG 
GTTACGAGGG 
GTCGCTACCC 
AGTAGTGACG 
GCACCCTGAA 
GAGAGCTGCA 
GACGCCTGCC 
CGAGCACGAG 
GCGACTAGGC 
ACAAAAATCC 
GAAAATGTAC 
GATGCACCCA 
AGCCACGGTG 
CCGCATACCA 
AAGATTCTCA 
CCCAACAGTG 
CGGGTCGGAT 
GAAACATCAT 
C AC AC TAC AG 
AGAACTACCC 
CTCCGGAGAA 
CACTACCTAT 
CAACTTGGTT 
TGTCAAAGTC 
AG CAGCGGAG 
ACTAAATGCT 
TGACG CGATC 
ATCATTCGAC 
TCTAGGGGTG 
CTGTCACCTA 
TCTGACTTTG 
GAGACTCACT 
GATCTCCGAC 
CATTGACGCT 
CAGCGTCACA 
TAAGCCGCTA 
GGTT 



AAGAAATGGC 
ATGTGCGGCT 
GCGGAAGGGG 
CTGTCACTGA 
GATAGGCTGC 
GTGACCATCT 
AGGACGGCTG 
CACCCGGACA 
TCGTACTTTG 
TTGTGGCCCA 
ATGGACAACA 
ACAGTGCCCT 
C AC C AAGTTA 
GGGGTGCAGA 
GTTAGTCCGC 
TTTGACTTGG 
AGTTTGCAGT 
GCTGACGTAC 
CCCGCAGACC 
TACCTTGCCT 
CCAAGGACTG 
GTCGATGCGT 
CGCGCGGGTG 
GTTAGGCAGC 
CCGCCAAAAT 
TCGGAGGCTA 
GTGGACAGGC 
ACATACG CGG 
AGCCCCGATG 
GCGTCGTACC 
AGTTGCTTGG 
GCGTACCACC 
AACGTGCTAG 
ACCATGGACT 
TATTGGGAAG 
GTGACCAAAT 
CCGCTGCAGG 
ACTCCAGGGA 
CCATTGGCGA 
GTGTTACGCC 
ATCGCCTCTC 
AAAAGCCAGG 
GATCAGTACC 
CCAACTGGCA 
TTTATTAACA 
GACTCCGCCT 
AAGCTGATGG 
GTCATGGGCG 
CAGACCGCCT 
ACAGCTGAAG 
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Figure 3^A; pDESTIO Polyhedron Promoter with N-His6, 

Baculovirus Transfer Plasmid 



154 



aaa tak gta ttt tac tgt ttt cgt aac agt ttt gta ata aaa aaa cct ata 
ttt att cat aaa atg aca aaa gca ttg tea aaa cat tat ttt ttt gga tat 



205 



s s » s iti s s s s s s a s k a » - 



256 gaa acc lg & fig" *> t Vfe Hu ^ TV tor Xj< fr» 
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pDESTIO 6708 bp 



Location (Base Nos . 
23. .152 
461 . .337 
711 . . 1370 
1490 . . 1574 
1712. .2017 
2058 . .2182 
3394 . .4369 
4510. .5164 
5658 . . 62 



Gene Encoded 

Ppolh 

attRl 

CmR 

inactivated ccdA 

ccdB 

attR2 

ampR 

ori 

genR 



1 CCCCGGATGA AGTGGTTCGC ATCCTCGGTT 
61 AGGACTCTAG CTATAGTTCT AGTGGTTGGC 
121 G G AG AT AATT AAAATGATAA CCATCTCGCA 
181 AGTTTTGTAA TAAAAAAACC TATAAATATT 
241 CGCGGATCTC GGTCCGAAAC CATGTCGTAC 
3 01 ATCCCAACGA CCGAAAACCT GTATTTTCAG 
361 CGAGAAACGT AAAATGATAT AAATATCAAT 
421 ACTACATAAT ACTGTAAAAC ACAACATATC 
481 CATCACCCGA CGCACTTTGC GCCGAATAAA 
541 AAATAAATCC TGGTGTCCCT GTTGATACCG 
6 01 GAGACGTTGA TCGGCACGTA AGAGGTTCCA 
661 CGGGCGTATT TTTTGAGTTA TCGAGATTTT 
721 AAATCACTGG AT AT AC C AC C GTTGATATAT 
781 CATTTCAGTC AGTTGCTCAA TGTAC CTATA 
841 TTTTAAAGAC CGTAAAGAAA AATAAG C AC A 
901 CCCGCCTGAT GAATGCTCAT CCGGAATTCC 
961 TATGGGATAG TGTTCACCCT TGTTACACCG 
1021 CGCTCTGGAG TGAATACCAC GACGATTTCC 
1081 TGGCGTGTTA CGGTGAAAAC CTGGCCTATT 
1141 TCGTCTCAGC CAATCCCTGG GTGAGTTTCA 
12 01 AC AACTT CTT CGCCCCCGTT TTCACCATGG 

12 61 TGATGCCGCT GGCGATTCAG GTTCATCATG 
1321 TGCTTAATGA ATTACAACAG TACTGCGATG 

13 81 CCGGCTTACT AAAAGCCAGA TAACAGTATG 
1441 GAATATATAC TGATATGTAT ACCCGAAGTA 

15 01 TATTACAGTG ACAGTTGACA GCGACAGCTA 
1561 ATCTCCGGTC TGGTAAGCAC AACCATGCAG 
1621 TGGAAAGCGG AAAATCAGGA AGGGATGGCT 

16 81 TCTTTTGCTG AC G AG AAC AG GGACTGGTGA 
1741 AGAGAGCCGT TATCGTCTGT TTGTGGATGT 
18 01 ACGGATGGTG ATCCCCCTGG CCAGTGCACG 
1861 TTACCCGGTG GTGCATATCG GGGATGAAAG 
1921 TGTGCCGGTC TCCGTTATCG GGGAAGAAGT 
1981 CAAAAACGCC ATTAACCTGA TGTTCTGGGG 
2 041 CCAGTCTGCA GGTCGAC CAT AGTGACTGGA 
2101 : ^ GTTTTTTATG CAAAATCTAA TTTAATATAT 
2161 CAGCTTTCTT GTACAAAGTG GTG ATG C CAT 
2221 AGCTCAACTA GTGCGGCCGC TTTCGAATCT 
22 81 AAGCTTGTCG AGAAGTACTA GAGGATCATA 
2341 CTTG CTTTAA AAAACCTCCC ACACCTCCCC 
2 4 01 GTTGTTGTTA ACTTGTTTAT TGCAGCTTAT 
24 61 AATTTCACAA ATAAAGCATT TTTTTCACTG 
2 521 AATGTATCTT ATCATGTCTG GATCTGATCA 
2 581 AAGTGAAATC TAGTTCCAAA CTATTTTGTC 



TTCTGGAAGG CGAGCATCGT TTGTTCGCCC 
TACGTATACT CCGGAATATT AAT AG AT CAT 
AATAAATAAG TATTTTACTG TTTTCGTAAC 
CCGGATTATT CATACCGTCC CACCATCGGG 
TACCATCACC ATCACCATCA CGATTACGAT 
GGCATCACAA GTTTGTACAA AAAAGCTGAA 
ATATTAAATT AG ATTTTG C A TAAAAAACAG 
CAGTCACTAT GGCGGCCGCT AAGTTGGCAG 
TACCTGTGAC GGAAGATCAC TTCGCAGAAT 
GGAAGCCCTG GGCCAACTTT TGGCGAAAAT 
ACTTTCACCA TAATGAAATA AGATCACTAC 
CAGGAGCTAA GGAAGCTAAA ATGGAGAAAA 
CCCAATGGCA TCGTAAAGAA CATTTTGAGG 
ACCAGACCGT TCAGCTGGAT ATTACGGCCT 
AGTTTTATCC GGCCTTTATT CACATTCTTG 
GTATGGCAAT GAAAGACGGT GAGCTGGTGA 
TTTTC C ATG A GCAAACTGAA ACGTTTTCAT 
GGCAGTTTCT ACACATATAT TCGCAAGATG 
TCCCTAAAGG GTTTATTGAG AATATGTTTT 
CCAGTTTTGA TTTAAAC GTG GCCAATATGG 
GCAAATATTA TACGCAAGGC GACAAGGTGC 
CCGTCTGTGA TGGCTTCCAT GTCGGCAGAA 
AGTGGCAGGG CGGGGCGTAA ACGCGTGGAT 
CGTATTTGCG CGCTGATTTT TGCGGTATAA 
TGTCAAAAAG AGGTGTGCTA TGAAGCAGCG 
TCAGTTGCTC AAGGCATATA TGATGTCAAT 
AATGAAGCCC GTCGTCTGCG TGCCGAACGC 
GAGGTCGCCC GGTTTATTGA AATGAACGGC 
AATGCAGTTT AAGGTTTACA CCTATAAAAG 
ACAGAGTGAT ATTATTGACA CGCCCGGGCG 
TCTGCTGTCA GATAAAGTCT CCCGTGAACT 
CTGGCGCATG ATGACCACCG ATATGG C CAG 
GGCTGATCTC AGCCACCGCG AAAATGACAT 
AATATAAATG TCAGGCTCCC TTATACACAG 
TATGTTGTGT TTTACAGTAT TATGTAGTCT 
TGATATTTAT ATCATTTTAC GTTTCTCGTT 
GGATCCGGAA TTCAAAGGCC TACGTCGACG 
AGAGCCTGCA GTCTCGAGGC ATGCGGTACC 
AT CAG C C ATA C C AC ATTTGT AGAGGTTTTA 
CTGAACCTGA AACATAAAAT GAATGCAATT 
AATGGTTACA AATAAAGCAA TAGCATCACA 
CATTCTAGTT GTGGTTTGTC CAAACTCATC 
CTGCTTGAGC CTAGGAGATC CGAACCAGAT 
ATTTTTAATT TTCGTATTAG CTTACGACGC - 
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2 641 TACACCCAGT T C C CAT CTAT TTTGTCACTC 
2 701 CCACCCCTCC CAGTTCCCAA CTATTTTGTC 
2 761 ATGTTTTTAA TCAAACATCC TGCCAACTCC 
2 821 TTCTCTGTCA CAGAATGAAA ATTTTTCTGT 
2 881 CTGAATATCA ACGCTTATTT GCAGCCTGAA 

2 941 ATTAAGCGCG GCGGGTGTGG TGGTTACGCG 

3 001 AGCGCCCGCT CCTTTCGCTT TCTTCCCTTC 
3 061 TCAAGCTCTA AATCGGGGGC TCCCTTTAGG 
3121 CCCCAAAAAA CTTGATTAGG GTGATGGTTC 
3181 TTTTCGCCCT TTGACGTTGG AGTCCACGTT 
3241 AACAACACTC AACCCTATCT CGGTCTATTC 
33 01 GGCCTATTGG TTAAAAAATG AG CTG ATTTA 
3 361 ATTAACGTTT ACAATTTCAG GTGGCACTTT 
3 421 TTTATTTTTC TAAATACATT CAAATATGTA 
3 481 GCTTCAATAA TATTGAAAAA GGAAGAGTAT 
3 541 TCCCTTTTTT G CGG CATTTT GCCTTCCTGT 
3 6 01 AAAAGATGCT GAAGATCAGT TGGGTGCACG 
3 661 CGGTAAGATC CTTGAGAGTT TTCGCCCCGA 
3 721 AGTTCTGCTA TGTGGCGCGG TATTATCCCG 
3 781 CCGCATACAC TATTCTCAGA ATGACTTGGT 
3 841 TACGGATGGC ATGACAGTAA GAGAATTATG 
3 901 TGCGGCCAAC TTACTTCTGA CAACGATCGG 

3 961 CAACATGGGG GATCATGTAA CTCGCCTTGA 

4 021 ACCAAACGAC GAGCGTGACA CCACGATGCC 
4 081 ATTAACTGGC GAACTACTTA CTCTAGCTTC 
4141 GGATAAAGTT GCAGGACCAC TTCTGCGCTC 
42 01 TAAATCTGGA GCCGGTGAGC GTGGGTCTCG 

42 61 TAAGCCCTCC , CGTATCGTAG TTATCTACAC 
4 321 AAATAGACAG ATCGCTGAGA TAGGTGCCTC 

43 81 AGTTTACTCA TATATACTTT AGATTGATTT 

44 41 GGTGAAGATC CTTTTTGATA ATCTCATGAC 
4 5 01 CTGAGCGTCA GACCCCGTAG AAAAGATCAA 

45 61 CGTAATCTGC TGCTTGCAAA CAAAAAAACC 
4 621 TCAAGAGCTA CCAACTCTTT TTCCGAAGGT 
4 681 TACTGTCCTT CTAGTGTAGC CGTAGTTAGG 
4 741 TACATACCTC GCTCTGCTAA TCCTGTTACC 
48 01 TCTTACCGGG TTGGACTCAA GACGATAGTT 
48 61 GGGGGGTTCG TGCACACAGC CCAGCTTGGA 
4 921 ACAGC GTGAG CATTGAGAAA GCGCCACGCT 

4 981 GGTAAGCGGC AGGGTCGGAA CAGGAGAGCG 

5 041 GTATCTTTAT AGTCCTGTCG GGTTTCGCCA 
5101 CTCGTCAGGG GGGCGGAGCC TATGGAAAAA 
5161 GGCCTTTTGC TGGCCTTTTG CTCACATGTT 
5221 TAACCGTATT ACCGCCTTTG AGTG AG CTG A 

52 81 CAGCGAGTCA GTGAG CGAGG AAGCGGAAGA 

53 41 TCTGTGCGGT ATTTCACACC GCAGACCAGC 

54 01 AGTAATAAAT GGATGCCCTG CGTAAGCGGG 

54 61 CAAAATAGAT CTAAACTATG ACAATAAAGT 
5 521 \AATCAGTCCA GTTATGCTGT G AAAAAG CAT 

55 81 CTTCATTTTC TGAAGTGCAA ATTGCCCGTC 
5641 GTAAAGACTA TATTCGCGGC GTTGTGACAA 

57 01 GATCTCGGCT TGAACGAATT GTTAGGTGGC 
5761 TTCTTCCCGT ATGCCCAACT TTGTATAGAG 

58 21 TTG CACGTAG ATCACATAAG CACCAAGCGC 
58 81 CGCGGTGGCA ATGCCCTGCC TCCGGTGCTC 

5 941 CTCACTACGC GGCTGCTCAA ACCTGGGCAG 

6 001 TTCTTGGTCG AAGGCAGCAA GCG CGATGAA 
6 061 ATCGGAGTCC GGCTGATGTT GGGAGTAGGT 



TTCCCTAAAT AATCCTTAAA AACTCCATTT 
CGCCCACAGC GGGG CATTTT TCTTCCTGTT 
ATGTGACAAA CCGTCATCTT CGGCTACTTT 
CATCTCTTCG TTATTAATGT TTGTAATTGA 
TGGCGAATGG GACGCGCCCT GTAGCGGCGC 
CAGCGTGACC GCTACACTTG CCAGCGCCCT 
CTTTCTCGCC ACGTTCGCCG GCTTTCCCCG 
GTTCCGATTT AGTG CTTTAC GGCACCTCGA 
ACGTAGTGGG CCATCGCCCT GATAGACGGT 
CTTTAATAGT GGACTCTTGT TCCAAACTGG 
TTTTGATTTA TAAGGGATTT TGCCGATTTC 
ACAAAAATTT AACGCGAATT TTAACAAAAT 
TCGGGGAAAT GTGCGCGGAA CCCCTATTTG 
TCCGCTCATG AGACAATAAC CCTGATAAAT 
GAGTATTCAA CATTTCCGTG TCGCCCTTAT 
TTTTGCTCAC CCAGAAACGC TGGTGAAAGT 
AGTGGGTTAC ATCGAACTGG ATCTCAACAG 
AGAACGTTTT CCAATGATGA GCACTTTTAA 
TATTGACGCC GGGCAAGAGC AACTCGGTCG 
TGAGTACTCA CCAGTCACAG AAAAGCATCT 
CAGTGCTGCC ATAACCATGA GTGATAACAC 
AGGACCGAAG GAG CTAAC C G CTTTTTTGCA 
TCGTTGGGAA CCGGAGCTGA ATG AAG C CAT 
TGTAGCAATG GCAACAACGT TGCGCAAACT 
CCGGCAACAA TTAATAGACT GGATGGAGGC 
GGCCCTTCCG GCTGGCTGGT TTATTG CTGA 
CGGTATCATT GCAGCACTGG GGCCAGATGG 
GACGGGGAGT CAGGCAACTA TGGATGAACG 
ACTGATTAAG C ATTG GTAAC TGTCAGACCA 
AAAACTTCAT TTTTAATTTA AAAGGATCTA 
CAAAATCCCT TAACGTGAGT TTTCGTTCCA 
AGGATCTTCT TGAGATCCTT TTTTTCTGCG 
ACCGCTACCA GCGGTGGTTT GTTTGCCGGA 
AACTGGCTTC AGCAGAGCGC AG ATAC C AAA 
CCACCACTTC AAGAACTCTG TAGCACCGCC 
AGTGGCTGCT GCCAGTGGCG ATAAGTCGTG 
AC CGGATAAG GCGCAGCGGT CGGGCTGAAC 
GCGAACGACC TACACCGAAC TGAGATACCT 
TCCCGAAGGG AGAAAGGCGG ACAGGTATCC 
CACGAGGGAG CTTCCAGGGG GAAACGCCTG 
CCTCTGACTT GAGCGTCGAT TTTTGTGATG 
CGCCAGCAAC GCGGCCTTTT TACGGTTCCT 
CTTTCCTGCG TTATCCCCTG ATTCTGTGGA 
TACCGCTCGC CGCAGCCGAA CGACCGAGCG 
GCG CCTGATG CGGTATTTTC TCCTTACGCA 
CGCGTAACCT GGCAAAATCG GTTACGGTTG 
TGTGGGCGGA CAATAAAGTC TTAAACTGAA 
CTTAAACTAG AC AG AATAG T TGTAAACTGA 
ACTGGACTTT TGTTATGGCT AAAGCAAACT 
GTATTAAAGA GGGGCGTGGC CAAGGGCATG 
TTTAC CGAAC AACTCCGCGG CCGGGAAGCC 
GGTACTTGGG T C GAT AT C AA AGTG CATC AC 
AGCCACTGCG GGATCGTCAC CGTAATCTGC 
GTTGGCCTCA TGCTTGAGGA GATTGATGAG 
GCCGGAGACT G C GAG AT CAT AGATATAGAT 
AACGTAAGCC GCGAGAGCGC CAACAACCGC 
TGTCTTACTA CGGAGCAAGT TCCCGAGGTA 
GGCTACGTCT CCGAACTCAC GACCGAAAAG - 
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6121 ATCAAGAGCA GCCCGCATGG ATTTGACTTG 
6181 GCCCATACTT GAGCCACCTA ACTTTGTTTT 
6241 GCTGCTGCGT AACATCGTTG CTGCTCCATA 
63 01 CTTGCTGCTT GGATGCCCGA GGCATAGACT 

63 61 AAACCGCCAC TGCGCCGTTA CCACCGCTGC 
6421 AGCGCATACG CTACTTGCAT T AC AG TTTAC 

64 81 GTGCCTTCAT CCGTTTCCAC GGTGTGCGTC 
6 541 AGGCATTTCT GTCCTGGCTG GCGAACGAGC 
66 01 CATTGGCGGC CTTGCTGTTC TTCTACGGCA 
66 61 AGGAGATCGG AAGACCTCGG CCGTCGCGGC 



GTCAGGGCCG AGCCTACATG TGCGAATGAT 
AGGGCGACTG CCCTGCTGCG TAACATCGTT 
ACATCAAACA TCGACCCACG GCGTAACGCG 
GTACAAAAAA ACAGTCATAA CAAGCCATGA 
GTTCGGTCAA GGTTCTGGAC CAGTTGCGTG 
GAACCGAACA GGCTTATGTC AACTGGGTTC 
ACCCGGCAAC CTTGGGCAGC AGCGAAGTCG 
GCAAGGTTTC GGTCTCCACG CATCGTCAGG 
AGGTGCTGTG CACGGATCTG CCCTGGCTTC 
GCTTGCCGGT GGTGCTGA 



Yi^iM sab 
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Figure 3/ A: p Dtrni Tet-rcgulated eukaryotic 

expression 
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pDESTll 7026 bp 



Location (Base Nos 
4 . .479 



638 . , 

888 . 

1667 

1889 

2235 

2402 

4347 

4940 



514 
1547 
. 1751 
.2194 
.2359 
. 4132 
.4803 
. 5797 



Gene Encoded 

Tetp ( (Tet operator) 7 and min 

hCMV promoter) 
attRl 
CmR 

inactivated ccdA 

ccdB 

attR2 

polyA 

fl ori 

ampR 



1 CGAGTTTACC ACTCCCTATC AGTGATAGAG 
61 TCAGTGATAG AGAAAAGTGA AAGTCGAGTT 
121 GAAAGTCGAG TTTACCACTC CCTATCAGTG 
181 TCCCTATCAG TGATAGAGAA AAGTGAAAGT 
241 AAAAGTGAAA GTCGAGTTTA CCACTCCCTA 
3 01 CGGTACCCGG GTCGAGTAGG CGTGTACGGT 
361 TGAACCGTCA GATCGCCTGG AGACGC CATC 
421 GGGACCGATC CAGCCTCCGC GGCCCCGAAT 
481 TCGAGGTCGA CGGTATCGAT AAGCTTGATA 
541 GAAACGTAAA ATGATATAAA TATCAATATA 
6 01 ACATAATACT GTAAAACACA AC ATATC CAG 
661 CACCCGACGC ACTTTGCGCC GAATAAATAC 
721 TAAATCCTGG TGTCCCTGTT GATACCGGGA 
781 ACGTTGATCG GCACGTAAGA GGTTC CAACT 
841 GCGTATTTTT TGAGTTATCG AGATTTTCAG 
901 TCACTGGATA TACCACCGTT GATATATCCC 
961 TTCAGTCAGT TGCTCAATGT AC CTATAAC C 
1021 TAAAGAC CGT AAAGAAAAAT AAGCACAAGT 
1081 GCCTGATGAA TGCTCATCCG GAATTCCGTA 
1141 GGGATAGTGT TCACCCTTGT TACAC CGTTT 
12 01 TCTGGAGTGA AT AC CACGAC GATTTCCGGC 

12 61 CGTGTTACGG TGAAAACCTG GCCTATTTCC 
1321 TCTCAGCCAA TCCCTGGGTG AGTTTCACCA 

13 81 ACTTCTTCGC CCCCGTTTTC ACCATGGGCA 
1441 TGCCGCTGGC GATTCAGGTT CATCATGCCG 

15 01 TTAATGAATT ACAACAGTAC TGCGATGAGT 
1561 GCTTACTAAA AGCCAGATAA CAGTATGCGT 
1621 TATATACTGA TATGTATACC CGAAGTATGT 

16 81 TACAGTGACA GTTGACAGCG ACAGCTATCA 
1741 TCCGGTCTGG TAAGCACAAC CATGCAGAAT 

,18 01 AAAGCGGAAA ATCAGGAAGG GATGG CTGAG 
1861 TTTGCTGACG AGAACAGGGA CTGGTGAAAT 
1921 GAGCCGTTAT CGTCTGTTTG TGGATGTACA 
19 81 GATGGTGATC CCCCTGGCCA GTGCACGTCT 
2 041 1 CCCGGTGGTG CATATCGGGG ATGAAAGCTG 
2101 GCCGGTCTCC GTTATCGGGG AAGAAGTGGC 
2161 AAACGCCATT AACCTGATGT TCTGGGGAAT 
2221 GTCTGCAGGT CGACCATAGT GACTGGATAT 
22 81 TTTTATGCAA AATCTAATTT AATATATTGA 
2 341 CTTTCTTGTA CAAAGTGGTT GATATCGAAT 
24 01 GAGCACTGCG ATGAGTGGCA GGGCGGGGCG 
24 61 TAAACGCCTG GTGCTACGCC TGAATAAGTG 
2 521 CGGATCTTTG TGAAGGAACC TTACTTCTGT 



AAAAGTGAAA GTCGAGTTTA CCACTCCCTA 
TACCACTCCC TATCAGTGAT AGAGAAAAGT 
ATAGAGAAAA GTGAAAGTCG AGTTTACCAC 
CGAGTTTACC ACTCCCTATC AGTGATAGAG 
TCAGTGATAG AGAAAAGTGA AAGTCGAGCT 
GGGAGGCCTA TATAAGCAGA GCTCGTTTAG 
CACGCTGTTT TGACCTCCAT AG AAGAC AC C 
TCGAGCTCGG TACCCGGGGA TCCTCTAGAG 
TCAACAAGTT TGTACAAAAA AGCTGAACGA 
TTAAATTAGA TTTTGCATAA AAAACAGACT 
TCACTATGGC GGCCGCTAAG TTG G CAG CAT 
CTGTGACGGA AGATCACTTC G C AG AATAAA 
AGCCCTGGGC CAACTTTTGG CGAAAATGAG 
TTCACCATAA TGAAATAAGA TCACTACCGG 
GAG CTAAGGA AGCTAAAATG GAGAAAAAAA 
AATGGCATCG TAAAGAACAT TTTGAGGCAT 
AGACCGTTCA GCTGGATATT ACGGCCTTTT 
TTTATCCGGC CTTTATTCAC ATTCTTGCCC 
TGGCAATGAA AGACGGTGAG CTGGTGATAT 
TCCATGAGCA AACTGAAACG TTTT CAT C G C 
AGTTTCTACA CAT AT ATT C G CAAGATGTGG 
CTAAAGGGTT TATTGAGAAT ATGTTTTTCG 
GTTTTGATTT AAACGTGGCC AATATGGACA 
AATATTATAC GCAAGGCGAC AAGGTGCTGA 
TCTGTGATGG CTTCCATGTC GGCAGAATGC 
GGCAGGGCGG GGCGTAAAGA TCTGGATCCG 
ATTTGCGCGC TGATTTTTGC GGTATAAGAA 
CAAAAAGAGG TGTGCTATGA AGCAGCGTAT 
GTTGCTCAAG G C ATATATG A TGT CAATATC 
GAAGCCCGTC GTCTGCGTGC CGAACGCTGG 
GTCGCCCGGT TTATTGAAAT GAACGGCTCT 
GCAGTTTAAG GTTTACACCT ATAAAAGAGA 
GAGTGATATT ATTGACACGC CCGGGCGACG 
GCTGTCAGAT AAAGTCTCCC GTGAACTTTA 
GCGCATGATG ACCACCGATA TGGCCAGTGT 
TGATCTCAGC CACCGCGAAA ATGACATCAA 
ATAAATGTCA GGCTCCCTTA TACAC AG C C A 
GTTGTGTTTT ACAGTATTAT GTAGTCTGTT 
TATTTATATC ATTTTACGTT TCTCGTTCAG 
TCCTGCAGCC CGGGGGATCC ACTAGTTCTA 
TAATTTTTTT AAGGCAGTTA TTGGTGCCCT 
ATAATAAGCG GATGAATGGC AG AAATTCG C 
GGTGTGACAT AATTGGACAA ACTACCTACA- 
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25 81 GAGATTTAAA GCTCTAAGGT AAATATAAAA 

2 641 ATTCTAATTG TTTGTGTATT TTAGATTCCA 

27 01 TGGAATGCCT TTAATGAGGA AAACCTGTTT 
2 761 GAGGCTACTG CTGACTCTCA ACATTCTACT 
2 821 CCCAAGGACT TTCCTTCAGA ATTGCTAAGT 

28 81 ACTCTTGCTT GCTTTGCTAT TTAC AC C AC A 

2 941 ATTATGGAAA AATATTCTGT AACCTTTATA 

3 0 01 CTGTTTTTTC TTACTCCACA CAGGCATAGA 
3 061 TTGTGTACCT TTAGCTTTTT AATTTGTAAA 
3121 GCCTTGACTA GAG AT C ATAA TCAGC CATAC 
3181 AAACCTCCCA CACCTCCCCC TGAAC CTGAA 
3241 CTTGTTTATT GCAGCTTATA ATGGTTACAA 
33 01 TAAAGCATTT TTTTCACTGC ATTCTAGTTG 

33 61 TCATGTCTGG ATCCCCAGGA AGCTCCTCTG 
3421 GAGAGGACAT TCCAATCATA GGCTGCCCAT 

34 81 CACTTAACAA AAAGGAAATT GGGTAGGGGT 
3 541 TAAAATATCT GGGAAGTCCC TTCCACTGCT 
36 01 AAATGTCAAC AG C AG AAAC A TACAAGCTGT 
3 661 CATCAAGAAG CACTGTGGTT GCTGTGTTAG 
3 721 C AC CTGTGTA GGTTC CAAAA TATCTAGTGT 
3 781 ACTCCACTGG ATAAGCATTA TCCTTATCCA 
3 841 GACTGTCAAC TGTAGCATTT TTTGGGGTTA 
3 9 01 TTGCTAACAC ACCCTGCAGC TCCAAAGGTT 

3 961 AC C CTTGAAT GGGTTTTCCA GCACCATTTT 

4 021 TTAACATAGC AGTTACCCCA ATAAC CTC AG 
4 081 TATTTCCACA GGTTAAGTCC TCATTTAAAT 
4141 CACCGCGGTG GAGCTCCAAT TCGCCCTATA 
42 01 TCGTTTTACA ACGTCGTGAC TGGGAAAACC 

42 61 CACATCCCCC TTTCGCCAGC TGGCGTAATA 
4321 AACAGTTGCG CAGCCTGAAT GGCGAATGGG 

43 81 CGGGTGTGGT GGTTACGCGC AGCGTGACCG 

44 41 CTTTCGCTTT CTTCCCTTCC TTTCTCGCCA 

45 01 ATCGGGGGCT CCCTTTAGGG TTCCGATTTA 
4 561 TTGATTAGGG TGATGGTTCA CGTAGTGGGC 
4 621 TGACGTTGGA GTCCACGTTC TTTAATAGTG 

46 81 ACCCTATCTC GGTCTATTCT TTTGATTTAT 
4 741 TAAAAAATGA GCTGATTTAA CAAAAATTTA 
48 01 CAATTTAGGT GGCACTTTTC GGGGAAATGT 
48 61 AATACATTCA AATATGTATC CGCTCATGAG 
4 921 TTGAAAAAGG AAGAGTATGA GTATTCAACA 

4 981 GGCATTTTGC CTTCCTGTTT TTGCTCACCC 

5 041 AGATCAGTTG GGTGCACGAG TGGGTTACAT 
5101 TGAGAGTTTT CGCCCCGAAG AACGTTTTCC 
5161 TGGCGCGGTA TTATCCCGTA TTGACGCCGG 
5221 TTCTCAGAAT GACTTGGTTG AGTACTCACC 

52 81 GACAGTAAGA G AATTATG C A GTGCTGCCAT 

53 41 ACTTCTGACA ACGATCGGAG GACCGAAGGA 

54 01 TCATGTAACT CGCCTTGATC GTTGGGAACC 

54 61 s : GCGTGACACC ACGATGCCTG TAG CAATGGC 
5 521 ACTACTTACT CTAGCTTCCC GGCAACAATT 

55 81 AGGACCACTT CTGCGCTCGG CCCTTCCGGC 

56 41 CGGTGAGCGT GGGTCTCGCG GTATCATTGC 

57 01 TATCGTAGTT ATCTACACGA CGGGGAGTCA 

57 61 CGCTGAGATA GGTGCCTCAC TGATTAAGCA 
5 821 TATACTTTAG ATTGATTTAA AACTTCATTT 

58 81 TTTTGATAAT CTCATGACCA AAATCCCTTA 

5 941 CCCCGTAGAA AAGATCAAAG GATCTTCTTG 

6 0 01 CTTGCAAACA AAAAAACCAC CGCTACCAGC 



TTTTTAAGTG TATAATGTGT TAAACTACTG 
ACCTATGGAA CTGATGAATG GGAGCAGTGG 
TGCTCAGAAG AAATG C CAT C TAGTGATGAT 
CCTCCAAAAA AGAAGAGAAA GGTAGAAGAC 
TTTTTGAGTC ATGCTGTGTT TAGTAATAGA 
AAGGAAAAAG CTGCACTGCT ATACAAGAAA 
AGTAGGCATA ACAGTTATAA TCATAACATA 
GTGTCTGCTA TTAATAACTA TGCTCAAAAA 
GGGGTTAATA AGGAATATTT GATGTATAGT 
CACATTTGTA GAGGTTTTAC TTG CTTTAAA 
ACATAAAATG AATGCAATTG TTGTTGTTAA 
ATAAAGCAAT AGCATCACAA ATTTCACAAA 
TGGTTTGTCC AAACTCATCA ATGTAT CTTA 
TGTCCTCATA AACCCTAACC TCCTCTACTT 
CCACCCTCTG TGTCCTCCTG TTAATTAGGT 
TTTTCACAGA CCGCTTTCTA AGGGTAATTT 
GTGTTCCAGA AGTGTTGGTA AACAGCCCAC 
CAGCTTTGCA CAAGGGCCCA ACACCCTGCT 
TAATGTGCAA AACAGGAGGC ACATTTTCCC 
TTTCATTTTT ACTTGGATCA GGAACCCAGC 
AAAC AG C C TT GTGGTCAGTG TTCATCTGCT 
CAGTTTGAGC AGGATATTTG GTC CTGTAGT 
CCCCACCAAC AGCAAAAAAA TGAAAATTTG 
CATGAGTTTT TTGTGTCCCT GAATGCAAGT 
TTTTAACAGT AACAGCTTCC CACATCAAAA 
TAGGCAAAGG AATTGCTCTA GAGCGGCCGC 
GTGAGTCGTA TTACGCGCGC TCACTGGCCG 
CTGGCGTTAC CCAACTTAAT CGCCTTGCAG 
GCGAAGAGGC CCGCACCGAT CGCCCTTCCC 
ACGCGCCCTG TAGCGGCGCA TTAAGCGCGG 
CTACACTTGC CAGCGCCCTA GCGCCCGCTC 
CGTTCGCCGG CTTTCCCCGT CAAGCTCTAA 
GTGCTTTACG GCACCTCGAC CCCAAAAAAC 
CATCGCCCTG ATAGACGGTT TTTCGCCCTT 
GACTCTTGTT CCAAACTGGA AC AAC ACT C A 
AAGGGATTTT GCCGATTTCG GCCTATTGGT 
ACGCGAATTT TAACAAAATA TTAACGCTTA 
GCGCGGAACC CCTATTTGTT TATTTTTCTA 
AC AATAAC C C TGATAAATGC TTCAATAATA 
TTTCCGTGTC GCCCTTATTC CCTTTTTTGC 
AGAAACGCTG GTGAAAGTAA AAGATGCTGA 
CGAACTGGAT CTCAACAGCG GTAAGATCCT 
AATGATG AG C ACTTTTAAAG TTCTGCTATG 
GCAAGAGCAA CTCGGTCGCC GCATACACTA 
AGTCACAGAA AAGCATCTTA CGGATGGCAT 
AAC CATGAGT GATAACACTG CGGCCAACTT 
GCTAACCGCT TTTTTG CACA AC ATGG GGGA 
GGAG CTGAAT GAAGCCATAC CAAACGACGA 
AACAACGTTG CGCAAACTAT TAACTGGCGA 
AATAGACTGG ATGGAGGCGG ATAAAGTTGC 
TGGCTGGTTT ATTGCTGATA AATCTGGAGC 
AGCACTGGGG CCAGATGGTA AGCCCTCCCG 
GGCAACTATG GATGAACGAA ATAGACAGAT 
TTGGTAACTG TCAGAC CAAG TTTACTCATA 
TTAATTTAAA AGGATCTAGG TGAAGATCCT 
ACGTGAGTTT TCGTTCCACT GAGCGTCAGA 
AGATCCTTTT TTTCTGCGCG TAATCTGCTG 
GGTGGTTTGT TTGCCGGATC AAGAGCTACC - 
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6 061 AACTCTTTTT CCGAAGGTAA CTGGCTTCAG C AG AG C G C AG AT AC C AAAT A CTGTCCTTCT 
6121 AGTGTAGCCG TAGTTAGGCC ACCACTTCAA GAACTCTGTA GCACCGCCTA CATACCTCGC 
6181 TCTGCTAATC CTGTTACCAG TGGCTGCTGC CAGTGGCGAT AAGTCGTGTC TTACCGGGTT 
6241 GGACTCAAGA C G AT AGTTAC CGGATAAGGC GCAGCGGTCG GGCTGAACGG GGGGTTCGTG 
63 01 CACACAGCCC AGCTTGGAGC GAACGACCTA CACCGAACTG AG ATAC CTAC AGCGTGAGCT 

63 61 ATGAGAAAGC GCCACGCTTC CCGAAGGGAG AAAGGCGGAC AGGTATCCGG TAAGCGGCAG 
6421 GGTCGGAACA GGAGAGCGCA CGAGGGAGCT TCCAGGGGGA AACGCCTGGT ATCTTTATAG 

64 81 TCCTGTCGGG TTTCGCCACC TCTGACTTGA GCGTCGATTT TTGTGATGCT CGTCAGGGGG 
6 541 GCGGAGCCTA TGGAAAAACG CCAGCAACGC GGC CTTTTTA CGGTTCCTGG CCTTTTGCTG 
6 6 01 GCCTTTTGCT CACATGTTCT TTCCTGCGTT ATCCCCTGAT TCTGTGGATA ACCGTATTAC 

66 61 CGCCTTTGAG TGAGCTGATA CCGCTCGCCG CAGCCGAACG ACCGAGCGCA GCGAGTCAGT 
6 721 GAG CGAGGAA GCGGAAGAGC GCCCAATACG CAAACCGCCT CTCCCCGCGC GTTGGCCGAT 

67 81 TCATTAATGC AGCTGGCACG AC AGGTTT C C CGACTGGAAA GCGGGCAGTG AGCGCAACGC 
6 841 AATTAATGTG AGTTAGCTCA CTCATTAGGC ACCCCAGGCT TTACACTTTA TGCTTCCGGC 
6 9 01 TCGTATGTTG TGTGGAATTG TGAG CGGATA ACAATTTCAC ACAGGAAACA GCTATGACCA 
6 961 TG ATT AC G C C AAGCGCGCAA TTAACCCTCA CTAAAGGGAA CAAAAGCTGG GTACCGGGCC 
7021 CCCCCT 
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Figure 3 2^A: pDEST12.2 CM V Promoter for Eukary otic 
Expression, SV40 Promo ter/ori for G418 Resistance 
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Nru\ 1 
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\ I / Bbs 1581 



Eam1105 I 6257 
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pDEST12.2 7278 bp (rotated to position 3900) 



Location (Base Nos 
86 . . 136 
220 . . 742 
1059 .. 935 
1168 . .1827 
1947. .2031 
2169. .2474 
2515 . .2639 
2824. .3186 
3310. .3378 
4363 . .5157 
5680. .6540 



Gene Encoded 
ori 

CMV promoter 

attRl 

CmR 

inactivated ccdA 

ccdB 

attR2 

small t & polyA 

lac 

neo 

ampR 



1 GGGGGGCGGA GCCTATGGAA AAACGCCAGC 
61 TGCTGGCCTT TTGCTCACAT GTTCTTTCCT 
121 ATTACCGCCT TTGAGTGAGC TGATACCGCT 
181 TCAGTGAGCG AGGAAGCGGA AGAGCTCGCG 
241 TGGCCCGCCT GGCTGACCGC CCAACGACCC 
3 01 TCCCATAGTA ACGCCAATAG GGACTTT CCA 
361 AACTGCCCAC TTGGCAGTAC ATCAAGTGTA 
421 CAATGACGGT AAATGGCCCG CCTGGCATTA 
481 TACTTGGCAG TACATCTACG TATTAGTCAT 
541 GTACATCAAT GGGCGTGGAT AGCGGTTTGA 
6 01 TGACGTCAAT GGGAGTTTGT TTTGGCACCA 
661 CAACTCCGCC CCATTGACGC AAATGGGCGG 
721 CAGAGCTCGT TTAGTGAACC GTCAGATCGC 
781 C C AT AGAAG A CACCGGGACC GATCCAGCCT 
841 AACAATTTCA CACAGGAAAC AG C TATGAC C 
901 ACACTATAGA AGGTACGCCT GCAGGTACCG 
961 AGAAACGTAA AATGATATAA ATATCAATAT 
1021 TACATAATAC TGTAAAACAC AAC AT AT C C A 
1081 GGCTTTACAC TTTATGCTTC CGGCTCGTAT 
1141 AGATTTTCAG GAGCTAAGGA AGCTAAAATG 

12 01 GATATATCCC AATGGCATCG TAAAGAACAT 
1261 ACCTATAACC AGAC CGTTCA GCTGGATATT 

13 21 AAGCACAAGT TTTATCCGGC CTTTATTCAC 
13 81 GAATTCCGTA TGGCAATGAA AGACGGTGAG 
1441 TACACCGTTT TCCATGAGCA AACTGAAACG 

15 01 GATTTCCGGC AGTTTCTACA CATATATTCG 
1561 GCCTATTTCC CTAAAGGGTT TATTGAGAAT 
1621 AGTTTCACCA GTTTTGATTT AAACGTGGCC 

16 81 ACCATGGGCA AATATTATAC GCAAGGCGAC 
1741 CATCATGCCG TCTGTGATGG CTTCCATGTC 
18 01 TGCGATGAGT GGCAGGGCGG GGCGTAAACG 

18 61 CAGTATGCGT ATTTGCGCGC TGATTTTTGC 
1921 CGAAGTATGT CAAAAAGAGG TGTGCTATGA 

19 81 AC AG C TAT C A GTTGCTCAAG G C AT ATATG A 
2 041 CATGCAGAAT GAAGC CCGTC GTCTGCGTGC 
2101 GATGGCTGAG GTCGCCCGGT TTATTGAAAT 
2161 CTGGTGAAAT GCAGTTTAAG GTTTACACCT 
22 21 TGGATGTACA GAGTGATATT ATTG AC AC GC 
22 81 GTGCACGTCT GCTGTCAGAT AAAGTCTCCC 
2 341 ATGAAAGCTG GCGCATGATG ACCACCGATA 
24 01 AAGAAGTGGC TGATCTCAGC C AC C G CG AAA 



AACGCGGCCT TTTTACGGTT CCTGGCCTTT 
GCGTTATCCC CTGATTCTGT GGATAACCGT 
CGCCGCAGCC G AAC G AC C G A GCGCAGCGAG 
AATGCATGTC GTTACATAAC TTACGGTAAA 
CCGCCCATTG AC GTC AATAA TGACGTATGT 
TTGACGTCAA TGGGTGGAGT ATTTACGGTA 
TC AT AT G C C A AGTACGCCCC CTATTGACGT 
TGCCCAGTAC ATGACCTTAT GGGACTTTCC 
CGCTATTACC ATGGTGATGC GGTTTTGGCA 
CTCACGGGGA TTTCCAAGTC TCCACCCCAT 
AAATCAACGG G ACT TT C C AA AATGTCGTAA 
TAGG CGTGTA CGGTGGGAGG TCTATATAAG 
CTGGAGACGC CATCCACGCT GTTTTGACCT 
CCGGACTCTA GCCTAGGCCG CGGGACGGAT 
ATTAGGCCTT TGCAAAAAGC TATTTAGGTG 
GATCACAAGT TTGTACAAAA AAGCTGAACG 
ATTAAATTAG ATTTTG CAT A AAAAACAGAC 
GTCACTATGG CGGCCGC ATT AGGCACCCCA 
AATGTGTGGA TTTTGAGTTA GGATCCGTCG 
GAGAAAAAAA TCACTGGATA TACCACCGTT 
TTTGAGGCAT TTCAGTCAGT TGCTCAATGT 
ACGGCCTTTT TAAAGACCGT AAAGAAAAAT 
ATTCTTGCCC GCCTGATGAA TGCTCATCCG 
CTGGTGATAT GGGATAGTGT TCACCCTTGT 
TTTTCATCGC TCTGGAGTGA AT AC C ACGAC 
CAAGATGTGG CGTGTTACGG TGAAAACCTG 
ATGTTTTTCG TCTCAGCCAA TCCCTGGGTG 
AATATGGACA ACTTCTTCGC CCCCGTTTTC 
AAGGTGCTGA TGCCGCTGGC GATTCAGGTT 
GGCAGAATGC TTAATGAATT ACAACAGTAC 
CGTGGATCCG GCTTACTAAA AGCCAGATAA 
GGTATAAGAA TATATACTGA TATGTATACC 
AGCAGCGTAT TACAGTGACA GTTGACAGCG 
TGTCAATATC TCCGGTCTGG TAAGCACAAC 
CGAACGCTGG AAAGCGGAAA ATCAGGAAGG 
GAACGGCTCT TTTGCTGACG AGAACAGGGA 
ATAAAAGAGA GAGCCGTTAT CGTCTGTTTG 
CCGGGCGACG GATGGTGATC CCCCTGGCCA 
GTGAACTTTA CCCGGTGGTG CATATCGGGG 
TGGCCAGTGT GCCGGTCTCC GTTATCGGGG 
ATGACATCAA AAACGCCATT AACCTGATGT- 
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24 61 TCTGGGGAAT ATAAATGTCA GGCTCCCTTA TACACAGCCA GTCTGCAGGT CGACCATAGT 
2 521 GACTGGATAT GTTGTGTTTT ACAGTATTAT GTAGTCTGTT TTTTATGCAA AATCTAATTT 
2 581 AATATATTGA TATTTATATC ATTTTAC GTT TCTCGTTCAG CTTTCTTGTA CAAAGTGGTG 
2 641 ATCGCGTGCA TGCGACGTCA TAGCTCTCTC CCTATAGTGA GTCGTATTAT AAGCTAGGCA 
2 701 CTGGCCGTCG TTTTACAACG TCGTGACTGG GAAAACTGCT AGCTTGGGAT CTTTGTGAAG 
2 761 GAACCTTACT TCTGTGGTGT GACATAATTG GACAAACTAC CTACAGAGAT TTAAAGCTCT 
2 821 AAGGTAAATA TAAAATTTTT AAGTGTATAA TGTGTTAAAC TAG CTG CATA TGCTTGCTGC 
28 81 TTGAGAGTTT TGCTTACTGA GTATGATTTA TGAAAATATT ATACACAGGA G C TAG TG ATT 

2 941 CTAATTGTTT GTGTATTTTA GATTCACAGT CCCAAGGCTC ATTTCAGGCC CCTCAGTCCT 

3 001 CACAGTCTGT T C ATG AT CAT AATCAG C CAT ACCACATTTG TAG AG GTTTT AC TTG CTTTA 
3 061 AAAAACCTCC CACACCTCCC CCTGAACCTG AAACATAAAA TGAATGCAAT TGTTGTTGTT 
3121 AACTTGTTTA TTGCAGCTTA TAATGGTTAC AAATAAAGCA ATAGCATCAC AAATTTCACA 
3181 AAT AAAG CAT TTTTTTCACT GCATTCTAGT TGTGGTTTGT CCAAACTCAT CAATGTATCT 
3241 TATCATGTCT GGATCGATCC TG C ATTAATG AATCGGCCAA CGCGCGGGGA GAGGCGGTTT 
3 3 01 GCGTATTGGC TGGCGTAATA GCGAAGAGGC CCGCACCGAT CGCCCTTCCC AACAGTTGCG 

33 61 CAGC CTGAAT GGCGAATGGG ACGCGCCCTG TAGCGGCGCA TTAAGCGCGG CGGGTGTGGT 
3421 GGTTACGCGC AG CGTGACCG CTACACTTGC CAGCGCCCTA GCGCCCGCTC CTTTCGCTTT 

34 81 CTTCCCTTCC TTTCTCGCCA CGTTCGCCGG CTTTCCCCGT CAAGCTCTAA ATCGGGGGCT 
3 541 CCCTTTAGGG TTCCGATTTA GTGCTTTACG GCACCTCGAC CCCAAAAAAC TTGATTAGGG 
36 01 TGATGGTTCA CGTAGTGGGC CATCGCCCTG ATAGACGGTT TTTCGCCCTT TGACGTTGGA 
3 661 GTCCACGTTC TTTAATAGTG GACTCTTGTT CCAAACTGGA ACAACACTCA ACCCTATCTC 
3 721 GGTCTATTCT TTTGATTTAT AAGGGATTTT GCCGATTTCG GCCTATTGGT TAAAAAATGA 
3 781 G CTG ATTTAA CAAATATTTA ACGCGAATTT TAACAAAATA TTAACGTTTA CAATTTCGCC 
3 841 TGATGCGGTA TTTTCTCCTT ACGCATCTGT GCGGTATTTC AC AC C G CATA CGCGGATCTG 
3 9 01 CGCAGCACCA TGGCCTGAAA TAACCTCTGA AAGAGGAACT TGGTTAGGTA CCTTCTGAGG 

3 961 CGGAAAGAAC CAGCTGTGGA ATGTGTGTCA GTTAGGGTGT GGAAAGTCCC CAGGCTCCCC 

4 021 AG C AG G C AG A AG TATG C AAA GCATGCATCT CAATTAGTCA GCAACCAGGT GTGGAAAGTC 
4 081 CCCAGGCTCC CCAGCAGGCA GAAGTATG C A AAGCATGCAT CTCAATTAGT CAGCAACCAT 
4141 AGTCCCGCCC CTAACTCCGC CCATCCCGCC CCTAACTCCG CCCAGTTCCG CCCATTCTCC 
42 01 GCCCCATGGC TGACTAATTT TTTTTATTTA TGCAGAGGCC GAGGCCGCCT CGGCCTCTGA 

42 61 GCTATTCCAG AAGTAGTGAG GAGGCTTTTT TGGAGGCCTA GGCTTTTGCA AAAAG C TTG A 
4 321 TTCTTCTGAC ACAACAGTCT CGAACTTAAG GCTAGAGCCA CCATGATTGA ACAAGATGGA 

43 81 TTGCACGCAG GTTCTCCGGC CGCTTGGGTG GAGAGGCTAT TCGGCTATGA CTGGGCACAA 
4441 CAGACAATCG GCTGCTCTGA TGCCGCCGTG TTCCGGCTGT CAGCGCAGGG GCGCCCGGTT 
4 501 CTTTTTGTCA AGACCGACCT GTCCGGTGCC CTGAATGAAC TGCAGGACGA GGCAGCGCGG 
4 561 CTATCGTGGC TGGCCACGAC GGGCGTTCCT TGCGCAGCTG TGCTCGACGT TGTCACTGAA 
4 621 GCGGGAAGGG ACTGGCTGCT ATTGGGCGAA GTGCCGGGGC AGGATCTCCT GTCATCTCAC 
46 81 CTTGCTCCTG CCGAGAAAGT AT C CAT C ATG G C TG ATG C AA TGCGGCGGCT GCATACGCTT 
4 741 GATCCGGCTA CCTGCCCATT CGACCACCAA GCGAAACATC GCATCGAGCG AGCACGTACT 
48 01 CGGATGGAAG CCGGTCTTGT CGATCAGGAT GATCTGGACG AAG AG CAT C A GGGGCTCGCG 
48 61 CCAGCCGAAC TGTTCGCCAG GCTCAAGGCG CGCATGCCCG ACGGCGAGGA TCTCGTCGTG 
4 921 ACCCATGGCG ATGCCTGCTT GCCGAATATC ATGGTGGAAA ATGGCCGCTT TTCTGGATTC 

4 981 ATCGACTGTG GCCGGCTGGG TGTGG CGGAC CGCTATCAGG ACATAGCGTT GGCTACCCGT 

5 041 GATATTGCTG AAGAG CTTGG CGGCGAATGG GCTGACCGCT TCCTCGTGCT TTACGGTATC 
5101 GCCGCTCCCG ATTCGCAGCG CATCGCCTTC TATCGCCTTC TTGACGAGTT CTTCTGAGCG 
5'161 GGACTCTGGG GTTCGAAATG AC CG AC C AAG CGACGCCCAA CCTGCCATCA CGATGGCCGC 
5221 AATAAAATAT CTTTATTTTC ATTACATCTG TGTGTTGGTT TTTTGTGTGA ATCGATAGCG 
5281 ATAAGGATCC GCGTATGGTG CACTCTCAGT ACAATCTGCT CTGATGCCGC ATAGTTAAGC 
5341: CAGCCCCGAC ACCCGCCAAC ACCCGCTGAC GCGCCCTGAC GGGCTTGTCT GCTCCCGGCA 
54 01 TCCGCTTACA GACAAGCTGT GACCGTCTCC GGGAGCTGCA TGTGTCAGAG GTTTTCACCG 
54 61 TCATCACCGA AACGCGCGAG ACGAAAGGGC CTCGTGATAC GCCTATTTTT ATAGGTTAAT 
5521 GTCATGATAA TAATGGTTTC TTAGACGTCA GGTGGCACTT TTCGGGGAAA TGTGCGCGGA 
5581 ACCCCTATTT GTTTATTTTT CTAAATACAT TCAAATATGT ATCCGCTCAT GAGACAATAA 
5641 CCCTGATAAA TG CTT C AAT A ATATTGAAAA AGGAAGAGTA TGAGTATTCA ACATTTCCGT 
57 01 GTCGCCCTTA TTCCCTTTTT TGCGG CATTT TGCCTTCCTG TTTTTGCTCA CCCAGAAACG 

57 61 CTGGTGAAAG T AAAAG ATG C TGAAGATCAG TTGGGTGCAC GAGTGGGTTA CATCGAACTG 
5 821 GATCTCAACA GCGGTAAGAT C C TTG AG AG T TTTCGCCCCG AAGAACGTTT TCCAATGATG 

58 81 AG C ACTTTT A AAGTTCTGCT ATGTGGCGCG GTATT AT C C C GTATTGACGC CGGGCAAGAG - 
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5 941 CAACTCGGTC G C C G CAT AC A CTATTCTCAG AATGACTTGG TTGAGTACTC ACCAGTCACA 

6 001 GAAAAG CATC TTACGGATGG CATGACAGTA AGAGAATTAT GCAGTG CTG C CATAACCATG 
6 061 AGTGATAACA CTGCGGCCAA CTTACTTCTG ACAACGATCG GAGGACCGAA GGAGCTAACC 
6121 GCTTTTTTGC ACAACATGGG GGATCATGTA ACTCGCCTTG ATCGTTGGGA ACCGGAGCTG 
6181 AATGAAGCCA TACCAAACGA CGAGCGTGAC ACCACGATGC CTGTAGCAAT GGCAACAACG 
6241 TTGCGCAAAC TATTAACTGG CGAACTACTT ACTCTAGCTT CCCGGCAACA ATTAATAGAC 

63 01 TGGATGGAGG CGGATAAAGT TGCAGGACCA CTTCTGCGCT CGGCCCTTCC GGCTGGCTGG 
6 361 TTTATTGCTG ATAAATCTGG AGCCGGTGAG CGTGGGTCTC GCGGTATCAT TGCAGCACTG 
6421 GGGCCAGATG GTAAGCCCTC CCGTATCGTA GTTATCTACA CGACGGGGAG TCAGGCAACT 

64 81 ATGGATGAAC GAAATAGACA GATCGCTGAG ATAGGTGCCT CACTGATTAA GCATTGGTAA 
6 541 CTGTCAGACC AAGTTTACTC ATATATACTT TAGATTGATT TAAAACTTCA TTTTTAATTT 
6 6 01 AAAAGG AT CT AGGTGAAGAT CCTTTTTGAT AATCTCATGA CCAAAATCCC TTAAC GTG AG 

66 61 TTTTCGTTCC ACTGAGCGTC AGACCCCGTA GAAAAG AT C A AAGGATCTTC TTGAGATCCT 
6 721 TTTTTTCTGC GCGTAATCTG CTGCTTGCAA ACAAAAAAAC CACCGCTACC AGCGGTGGTT 

67 81 TGTTTGCCGG ATCAAGAGCT ACCAACTCTT TTTCCGAAGG TAACTGGCTT C AG C AG AGCG 
6 841 CAGATACCAA ATACTGTCCT TCTAGTGTAG CCGTAGTTAG GCCACCACTT CAAGAACTCT 
6 9 01 GTAGCACCGC CTACATACCT CGCTCTGCTA ATCCTGTTAC CAGTGGCTGC TGCCAGTGGC 

6 961 GATAAGTCGT GTCTTACCGG GTTGGACTCA AGACGATAGT TAC CGGATAA GGCGCAGCGG 

7 021 TCGGGCTGAA CGGGGGGTTC GTG C AC AC AG CCCAGCTTGG AGCGAACGAC CTACACCGAA 
70 81 CTGAGATACC TACAGCGTGA GCATTGAGAA AGCGCCACGC TTCCCGAAGG GAGAAAGGCG 
7141 GACAGGTATC CGGTAAGCGG CAGGGTCGGA ACAGGAGAGC GCACGAGGGA GCTTCCAGGG 
72 01 GGAAACGCCT GGTATCTTTA TAGTCCTGTC GGGTTTCGCC ACCTCTGACT TGAGCGTCGA 
72 61 TTTTTGTGAT GCTCGTCA 
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Figure ^A l - 
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pDEST13 5848 bp 



Location (Base Nos.) 
599 . . 1458 
4123 
4372 
5151 
5373 



3998 
5031 
,5235 
.5678 



5719 . .5843 



Gene Encoded 

ampR 

attRl 

CmR 

inactivated ccdA 

ccdB 

attR2 



1 TTCACTGGCC GTCGTTTTAC AACGTCGTGA CTGGGAAAAC CCTGGCGTTA CCCAACTTAA 
61 TCGCCTTGCA GCACATCCCC CTTTCGCCAG CTGGCGTAAT AG CGAAGAGG CCCGCACCGA 
121 TCGCCCTTCC CAACAGTTGC GCAGCCTGAA TGGCGAATGG CGCCTGATGC GGTATTTTCT 
181 CGTTACGCAT CTGTGCGGTA TTTCACACCG C ATATGGTG C ACTCTCAGTA CAATCTGCTC 
241 TGATGCCGCA TAGTTAAGCC AGCCCCGACA CCCGCCAACA CCCGCTGACG CGCCCTGACG 
3 01 GGCTTGTCTG CTCCCGGCAT CCGCTTACAG ACAAGCTGTG ACCGTCTCCG G GAG CTG CAT 
361 GTGTCAGAGG TTTTC AC CGT CATCACCGAA ACGCGCGAGA CGAAAGGGCC TCGTGATACG 
421 CCTATTTTTA TAGGTTAATG TCATGATAAT AATGGTTTCT TAGACGTCAG GTGGCACTTT 
481 TCGGGGAAAT GTGCGCGGAA CCCCTATTTG TTTATTTTTC TAAAT AC AT T CAAATATGTA 
541 TCCGCTCATG AGACAATAAC CCTGATAAAT GCTTCAATAA TATTGAAAAA GGAAGAGTAT 
6 01 GAGTATTCAA CATTTCCGTG TCGCCCTTAT TCCCTTTTTT GCGGCATTTT GCCTTCCTGT 
661 TTTTGCTCAC CCAGAAACGC TGGTGAAAGT AAAAGATGCT GAAGATCAGT TGGGTGCACG 
721 AGTGGGTTAC ATCGAACTGG ATCTCAACAG CGGTAAGATC CTTGAGAGTT TTCGCCCCGA 
781 AGAACGTTTT CCAATGATGA GCACTTTTAA AGTTCTGCTA TGTGGCGCGG TATTATCCCG 
841 TATTGACGCC GGGCAAGAGC AACTCGGTCG CCGCATACAC TATTCTCAGA ATG AC TTGGT 
9 01 TGAGTACTCA CCAGTCACAG AAAAGCATCT TACGGATGGC ATGACAGTAA GAGAATTATG 
961 CAGTGCTGCC ATAAC C ATG A GTGATAACAC TGCGGCCAAC TTACTTCTGA CAACGATCGG 
1021 AGGACCGAAG GAGCTAACCG CTTTTTTGCA CAACATGGGG GATCATGTAA CTCGCCTTGA 
1081 TCGTTGGGAA CCGGAGCTGA ATGAAGC CAT AC CAAACGAC GAGCGTGACA CCACGATGCC 
1141 TGTAGCAATG GCAACAACGT TGCGCAAACT ATTAACTGGC GAACTACTTA CTCTAGCTTC 
1201 CCGGCAACAA TTAATAGACT GGATGGAGGC GGATAAAGTT GCAGGACCAC TTCTGCGCTC 
1261 GGCCCTTCCG GCTGGCTGGT TTATTGCTGA TAAATCTGGA GCCGGTGAGC GTGGGTCTCG 
1321 CGGTATCATT GCAGCACTGG GGCCAGATGG TAAGCCCTCC CGTATCGTAG TTATCTACAC 

13 81 GACGGGGAGT CAGGCAACTA TGGATGAACG AAATAGACAG ATCGCTGAGA TAGGTGCCTC 

14 41 ACTGATTAAG CATTGGTAAC TGTCAGACCA AGTTTACTCA TATATACTTT AGATTGATTT 
1501 AAAACTTCAT TTTTAATTTA AAAGGATCTA GGTGAAGATC CTTTTTGATA ATCTCATGAC 
1561 CAAAATCCCT TAAC GTGAGT TTTCGTTCCA CTGAGCGTCA G AC C C C GT AG AAAAGATCAA 
1621 AGGATCTTCT TGAGATCCTT TTTTTCTGCG CGTAATCTGC TGCTTGCAAA CAAAAAAACC 
16 81 ACCGCTACCA ' GCGGTGGTTT GTTTGCCGGA TCAAGAGCTA CCAACTCTTT TTCCGAAGGT 
1741 AACTGGCTTC AGCAGAGCGC AGAT AC C AAA TACTGTTCTT CTAGTGTAGC CGTAGTTAGG 
18 01 CCACCACTTC AAGAACTCTG TAGCACCGCC TACATACCTC GCTCTGCTAA TCCTGTTACC 
18 61 AGTGGCTGCT GCCAGTGGCG ATAAGTCGTG TCTTACCGGG TTGGACTCAA GACGATAGTT 
1921 ACCGGATAAG GCGCAGCGGT CGGGCTGAAC GGGGGGTTCG TGCACACAGC CCAGCTTGGA 
1981 GCGAACGACC TACACCGAAC TGAGATACCT AC AG C GTG AG CATTGAGAAA GCGCCACGCT 
2 041 TCCCGAAGGG AGAAAGGCGG ACAGGTATCC GGTAAGCGGC AGGGTCGGAA CAGGAGAGCG 
2101 CACGAGGGAG CTTCCAGGGG GAAACGCCTG GTATCTTTAT AGTCCTGTCG GGTTTCGCCA 
2161 CCTCTGACTT GAGCGTCGAT TTTTGTGATG CTCGTCAGGG GGGCGGAGCC TATGGAAAAA 
2221 CGCCAGCAAC GCGGCCTTTT TACGGTTCCT GGCCTTTTGC TGGCCTTTTG CTCACATGTT 
22 81 CTTTCCTGCG TTATCCCCTG ATTCTGTGGA TAAC C GTATT ACCGCCTTTG AGTGAGCTGA 
2 341 TACCGCTCGC CGCAGCCGAA CGACCGAGCG CAGCGAGTCA GTGAGCGAGG AAGCGGAAGA 
2401 GCGCCCAATA CGCAAACCGC CTCTCCCCGC GCGTTGGCCG ATT C ATTAAT GCAGCTGGCA 
24 61 CGACAGGTTT CCCGACTGGA AAGCGGGCAG TGAGCGCAAC GCAATTAATG TGAGTTAGCT 
2 521 CACTCATTAG GCACCCCAGG CTTTACACTT TATGCTTCCG GCTCGTATGT TGTGTGGAAT 
2 581 TGTGAG CGGA TAACAATTTC AC AC AG GAAA CAGCTATGAC CATGATTACG CCAAGCTTGG 
2 641 CTGCAGGTGA TGATTATCAG CCAGCAGAGA TTAAGGAAAA CAGACAGGTT TATTGAGCGC 
27 01 TTATCTTTCC CTTTATTTTT GCTGCGGTAA GTCGCATAAA AACCATTCTT C ATAATTC AA - 
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2 761 TCCATTTACT ATGTTATGTT CTGAGGGGAG TGAAAATTCC CCTAATTCGA TGAAGATTCT 
2821 TGCTCAATTG TTATCAGCTA TGCGCCGACC AG AAC AC C TT GCCGATCAGC CAAACGTCTC 
28 81 TTCAGGCCAC TGACTAGCGA TAACTTTCCC CACAACGGAA CAACTCTCAT TGCATGGGAT 

2 941 CATTGGGTAC TGTGGGTTTA GTGGTTGTAA AAACACCTGA CCGCTATCCC TGATCAGTTT 

3 0 01 CTTGAAGGTA AACTCATCAC CCCCAAGTCT GGCTATGCAG AAATCACCTG GCTCAACAGC 
3 061 CTGCTCAGGG TCAACGAGAA TTAACATTCC GTCAGGAAAG CTTGGCTTGG AGCCTGTTGG 
3121 TGCGGTCATG GAATTACCTT CAACCTCAAG CCAGAATGCA GAATCACTGG CTTTTTTGGT 
3181 TGTGCTTACC CATCTCTCCG CATCACCTTT GGTAAAGGTT CTAAG CTTAG GTGAGAACAT 
3 241 CCCTGCCTGA ACATGAGAAA AAACAGGGTA CTCATACTCA CTTCTAAGTG ACGGCTGCAT 
3 3 01 ACTAACCGCT TCATACATCT CGTAGATTTC TCTGGCGATT GAAGGGCTAA ATTCTTCAAC 

33 61 GCTAACTTTG AGAATTTTTG CAAGCAATGC GGCGTTATAA GCATTTAATG CATTGATGCC 
3 421 ATTAAATAAA GCACCAACGC CTGACTGCCC CATCCCCATC TTGTCTGCGA C AG ATT C C TG 

34 81 GGATAAGCCA AGTTCATTTT TCTTTTTTTC ATAAATTGCT TTAAGGCGAC GTGCGTCCTC 
3 541 AAGCTGCTCT TGTGTTAATG GTTTCTTTTT TGTGCTCATA CGTTAAATCT ATCACCGCAA 
3 6 01 GGGATAAATA TCTAACACCG TGCGTGTTGA CTATTTTACC TCTGGCGGTG ATAATGGTTG 
3 661 CATGTACTAA GGAGGTTGTA T GG AAC AAC G CATAACCCTG AAAGATTATG CAATGCGCTT 
3 721 TGGGCAAACC AAGACAGCTA AAGATCTCTC ACCTAC CAAA CAATGCCCCC CTGCAAAAAA 
3 781 TAAATTCATA TAAAAAACAT AC AGATAAC C ATCTGCGGTG ATAAATTATC TCTGGCGGTG 
3 841 TTGACATAAA TACCACTGGC GGTGATACTG AG C AC AT C AG CAGGACGCAC TG AC C AC CAT 
3 901 G AAGGTG AC G CTCTTAAAAA TTAAGCCCTG AAGAAGGG C A GCATTCAAAG CAGAAGGCTT 

3 961 TGGGGTGTGT GATACGAAAC GAAGCATTGG GATCATCACA AGTTTGTACA AAAAAG C TG A 

4 021 ACGAGAAACG TAAAATGATA TAAATATCAA TATATTAAAT TAGATTTTGC ATAAAAAACA 
4 081 GACTACATAA TACTGTAAAA CACAACATAT C CAGTC ACTA TGGCGGCCGC TAAGTTGGCA 
4141 GCATCACCCG ACGCACTTTG CGCCGAATAA ATACCTGTGA CGGAAGATCA CTTCGCAGAA 
42 01 TAAATAAATC CTGGTGTCCC TGTTGATACC GGGAAGCCCT GGGCCAACTT TTGG CGAAAA 
42 61 TGAGACGTTG ATCGGCACGT AAGAGGTTCC AACTTTCACC ATAATGAAAT AAGATCACTA 
4 321 CCGGGCGTAT TTTTTGAGTT ATCGAGATTT TCAGGAGCTA AGGAAGCTAA AATGGAGAAA 
4381 AAAATCACTG GATATACCAC CGTTGATATA TCCCAATGGC ATCGTAAAGA ACATTTTGAG 
4441 GCATTTCAGT CAGTTGCTCA ATGTAC CTAT AAC C AG AC CG TTCAGCTGGA TATTACGGCC 

45 01 TTTTTAAAGA CCGTAAAGAA AAATAAGCAC AAGTTTTATC CGGCCTTTAT TCACATTCTT 
4 561 GCCCGCCTGA TGAATGCTCA TCCGGAATTC CGTATGGCAA TGAAAGACGG TGAGCTGGTG 
4 621 ATATGGGATA GTGTTCACCC TTGTTACACC GTTTTCCATG AG C AAACTG A AACGTTTTCA 

46 81 TCGCTCTGGA GTGAATACCA CGACGATTTC CGGCAGTTTC TACACATATA TTCGCAAGAT 
4 741 GTGGCGTGTT ACGGTGAAAA CCTGGCCTAT TTCCCTAAAG GGTTTATTGA GAATATGTTT 
4 8 01 TTCGTCTCAG CCAATCCCTG GGTGAGTTTC AC CAGTTTTG ATTTAAACGT GGCCAATATG 
4 861 GACAACTTCT TCGCCCCCGT TTTCACCATG GG C AAATATT ATACGCAAGG CGACAAGGTG 
4 921 CTGATGCCGC TGGCGATTCA GGTTCATCAT GCCGTCTGTG ATGGCTTCCA TGTCGGCAGA 

4 981 ATGCTTAATG AATTACAACA GTACTGCGAT GAGTGGCAGG GCGGGGCGTA AACG CGTGGA 

5 041 TCCGGCTTAC TAAAAGC C AG ATAACAGTAT- GCGTATTTGC GCGCTGATTT TTGCGGTATA 
5101 AGAATATATA CTGATATGTA TACCCGAAGT ATGTCAAAAA GAGGTGTGCT ATGAAGCAGC 
5161 GTATTACAGT G AC AGTT G AC AGCGACAGCT ATCAGTTGCT CAAGGCATAT ATGATGTCAA 
5221 TATCTCCGGT CTGGTAAGCA C AAC C ATGCA GAATGAAGCC CGTCGTCTGC GTGCCGAACG 
52 81 CTGGAAAGCG GAAAATCAGG AAGGGATGGC TGAGGTCGCC CGGTTTATTG AAATGAACGG 
5 341 CTCTTTTGCT GACGAGAACA GGGACTGGTG AAATGCAGTT TAAGGTTTAC AC CTATAAAA 
54 01 GAGAGAGCCG TTATCGTCTG TTTGTGGATG TACAGAGTGA TATTATTGAC ACGCCCGGGC 
5461 GACGGATGGT GATCCCCCTG GCCAGTGCAC GTCTGCTGTC AGATAAAGTC TCCCGTGAAC 
5 521 TTTACCCGGT GGTGCATATC GGGGATGAAA GCTGGCGCAT GATGACCACC GATATGGCCA 
5 581 GTGTGCCGGT CTCCGTTATC GGGGAAGAAG TGGCTGATCT CAGCCACCGC GAAAATGACA 
5641 vTCAAAAACGC CATTAACCTG ATGTTCTGGG GAATATAAAT GTCAGGCTCC GTTATACACA 
57 01 GCCAGTCTGC AGGTCGACCA TAGTGACTGG ATATGTTGTG TTTTACAGTA TTATGTAGTC 
57 61 TGTTTTTTAT GCAAAATCTA ATTTAATATA TTGATATTTA TATCATTTTA CGTTTCTCGT 
5821 TCAGCTTTCT TGTACAAAGT GGTGATAA 
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Figure 3 pDEST14 Native Protein Expression in E. coli, T7 
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pDEST14 6422 bp (rotated to position 4000) 



Location (Base Nos 
185 . . 61 
435 . . 1094 
1214 . . 1298 
1436 . . 1741 
1782 . . 1906 
2632 . . 3489 



Gene Encoded 

attRl 

CmR 

inactivated ccdA 

ccdB 

attR2 

ampR 



1 CGATCCCGCG AAATTAATAC GACTCACTAT 

61 ACAAGTTTGT ACAAAAAAGC TGAACGAGAA 

121 AATTAGATTT TGCATAAAAA ACAGACTACA 

181 CTATGGCGGC CGCTAAGTTG GCAGCATCAC 

241 TGACGGAAGA TCACTTCGCA GAATAAATAA 

3 01 CCTGGGCCAA GTTTTGGCGA AAATG AG AC G 

361 ACCATAATGA AATAAGATCA CTACCGGGCG 

421 CTAAGGAAGC TAAAATGGAG AAAAAAATCA 

481 GGCATCGTAA AGAACATTTT GAGGCATTTC 

541 CCGTTCAGCT GGATATTACG GCCTTTTTAA 

6 01 ATCCGGCCTT TATTCACATT CTTGCCCGCC 

661 CAATGAAAGA CGGTGAGCTG GTGATATGGG 

721 ATGAGCAAAC TGAAACGTTT TCATCGCTCT 

781 TTCTACACAT ATATTCGCAA GATGTGGCGT 

841 AAGGGTTTAT TGAGAATATG TTTTTCGTCT 

901 TTGATTTAAA CGTGGCCAAT ATGGACAACT 

961 ATTATACGCA AGGCGACAAG GTGCTGATGC 

1021 GTGATGGCTT CCATGTCGGC AGAATG CTTA 

1081 AGGGCGGGGC GTAAACGCGT GGATCCGGCT 

1141 TGCGCGCTGA TTTTTGCGGT ATAAGAATAT 

12 01 AAAGAGGTGT GCTATGAAGC AG CGTATTAC 
1261 GCTCAAGGCA TATATGATGT CAATATCTCC 
1321 GCCCGTCGTC TGCGTGCCGA ACG CTGGAAA 

13 81 GCCCGGTTTA TTGAAATGAA CGGCTCTTTT 
1441 GTTTAAGGTT TACACCTATA AAAGAGAGAG 
15 01 TGATATTATT GACACGCCCG GGCGACGGAT 
1561 GTCAGATAAA GTCTCCCGTG AACTTTACCC 
1621 CATGATGACC ACCGATATGG CCAGTGTGCC 
1681 TCTCAGCCAC ' CGCGAAAATG AC AT C AAAAA 
1741 AATGTCAGGC TCCCTTATAC ACAGCCAGTC 
18 01 GTGTTTTACA GTATTATGTA GTCTGTTTTT 
18 61 TTATATCATT TTACGTTTCT CGTTCAGCTT 
1921 TAACAAAGCC CGAAAGGAAG CTGAGTTGGC 
1981 ACCCCTTGGG GCCTCTAAAC GGGTCTTGAG 
2041 CGGATATCCA CAGGACGGGT GTGGTCGCCA 
2101 GCGAAGCGAG CAGGACTGGG CGGCGGCCAA 
2161 C G CAT AG AAA TTGCATCAAC GCATATAGCG 
2221* TGTCGGAATG GAC GATATC C CGCAAGAGGC 
22 81 CTACAGCATC CAGGGTGACG GTGCCGAGGA 
2341 ACGGTGCCTG ACTG CGTTAG CAATTTAACT 
24 01 TGATAAGCTG TCAAACATGA GAATTCTTGA 

24 61 TTATAGGTTA ATGTCATGAT AATAATG GTT 
2 521 AATGTGCGCG GAACCCCTAT TTGTTTATTT 

25 81 ATGAGACAAT AACCCTGATA AATGCTTCAA 
2 641 CAACATTTCC GTGTCGCCCT TATTCCCTTT 
2 701 C AC C C AG AAA CGCTGGTGAA AGTAAAAGAT 



AGGGAGACCA CAACGGTTTC CCTCTAGATC 
ACGTAAAATG ATATAAATAT CAATATATTA 
TAATACTGTA AAACACAACA TATCCAGTCA 
CCGACGCACT TTGCGCCGAA TAAATAC CTG 
ATCCTGGTGT CCCTGTTGAT ACCGGGAAGC 
TTGATCGGCA CGTAAGAGGT TCCAACTTTC 
TATTTTTTGA GTTATCGAGA TTTT C AG GAG 
CTGGATATAC CACCGTTGAT ATATCCCAAT 
AGTCAGTTGC TCAATGTACC TATAACCAGA 
AGAC CGTAAA GAAAAATAAG CACAAGTTTT 
TGATGAATGC TCATCCGGAA TTCCGTATGG 
ATAGTGTTCA CCCTTGTTAC ACCGTTTTCC 
GGAGTGAATA C CACGAC GAT TTCCGGCAGT 
GTTACGGTGA AAACCTGGCC TATTTCCCTA 
CAGCCAATCC CTGGGTGAGT TTCACCAGTT 
TCTTCGCCCC CGTTTTCACC ATGGGCAAAT 
CGCTGGCGAT TCAGGTTCAT CATGCCGTCT 
ATGAATTACA ACAGTACTGC GATGAGTGGC 
TACTAAAAGC CAGATAACAG TATGCGTATT 
ATACTGATAT GTATACCCGA AGTATGTCAA 
AGTGACAGTT GACAGCGACA GCTATCAGTT 
GGTCTGGTAA GCACAAC CAT GCAGAATGAA 
GCGGAAAATC AGGAAGGGAT GGCTGAGGTC 
GCTGACGAGA ACAGGGACTG GTGAAATGCA 
CCGTTATCGT CTGTTTGTGG ATGTACAGAG 
GGTGATCCCC CTGGCCAGTG CACGTCTGCT 
GGTGGTGCAT ATCGGGGATG AAAGCTGGCG 
GGTCTCCGTT ATCGGGGAAG AAGTGGCTGA 
CGCCATTAAC CTGATGTTCT GGGGAATATA 
TGCAGGT CG A CCATAGTGAC TGGATATGTT 
TATGCAAAAT CTAATTTAAT ATATTGATAT 
TCTTGTACAA AGTGGTGATG ATCCGGCTGC 
TGCTGCCACC GCTGAGCAAT AACTAGCATA 
GGGTTTTTTG CTGAAAGGAG GAACTATATC 
TGATCGCGTA GTCGATAGTG GCTCCAAGTA 
AGCGGTCGGA CAGTGCTCCG AGAAC GGGTG 
CTAG CAGCAC GCCATAGTGA CTGGCGATGC 
CCGGCAGTAC CGGCATAACC AAGCCTATGC 
TGACGATGAG CG C ATTGTTA GATTTCATAC 
GTGATAAACT AC CGCATTAA AGCTTATCGA 
AGACGAAAGG GCCTCGTGAT ACGCCTATTT 
TCTTAGACGT CAGGTGGCAC TTTTC GGGGA 
TT C TAAATAC ATTCAAATAT GTATCCGCTC 
TAATATTGAA AAAGGAAGAG TATGAGTATT 
TTTGCGGCAT TTTGCCTTCC TGTTTTTGCT 
GCTGAAGATC AGTTGGGTGC ACGAGTGGGT - 
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27 61 TACATCGAAC TGGATCTCAA CAGCGGTAAG ATC CTTGAG A GTTTTCGCCC CGAAGAACGT 
2 821 TTTCCAATGA TGAGCACTTT TAAAGTTCTG CTATGTGGCG CGGTATTATC CCGTGTTGAC 

28 81 GCCGGGCAAG AG CAACTCGG TCGCCGCATA CACTATTCTC AGAATGACTT GGTTGAGTAC 

2 941 TCACCAGTCA CAGAAAAGCA TCTTACGGAT GGCATGACAG TAAGAGAATT ATGCAGTGCT 

3 0 01 GCCATAACCA TGAGTGATAA CACTGCGGCC AACTTACTTC TGACAACGAT CGGAGGACCG 
3 061 AAGGAGCTAA CCGCTTTTTT GCACAACATG GGGGATCATG TAACTCGCCT TGATCGTTGG 
3121 GAACCGGAGC TGAATGAAGC CATACCAAAC GACGAGCGTG ACACCACGAT GCCTGCAGCA 
3181 ATGGCAACAA CGTTGCGCAA ACTATTAACT GGCGAACTAC TTACTCTAGC TTCCCGGCAA 
3 241 CAATTAATAG ACTGGATGGA GGCGGATAAA GTTGCAGGAC CACTTCTGCG CTCGGCCCTT 
3 3 01 CCGGCTGGCT GGTTTATTGC TGATAAATCT GGAGCCGGTG AGCGTGGGTC TCGCGGTATC 
3 3 61 ATTGCAGCAC TGGGGCCAGA TGGTAAGCCC TCCCGTATCG TAGTTATCTA CACGACGGGG 
3421 AGTCAGG CAA CTATGGATGA AC GAAATAG A CAGATCGCTG AGATAGGTGC CTCACTGATT 
34 81 AAGCATTGGT AACTGTCAGA CCAAGTTTAC TCATATATAC TTTAGATTGA TTTAAAACTT 
3 541 CATTTTTAAT TTAAAAGGAT CTAGGTGAAG ATCCTTTTTG ATAATCTCAT GACCAAAATC 
36 01 CCTTAACGTG AGTTTTCGTT CCACTGAGCG TCAGACCCCG TAGAAAAGAT CAAAGGATCT 
3 661 TCTTGAGATC CTTTTTTTCT GCGCGTAATC TGCTGCTTGC AAACAAAAAA ACCACCGCTA 
3 721 CCAGCGGTGG TTTGTTTGCC GGATCAAGAG CTACCAACTC TTTTTCCGAA GGTAACTGGC 
3 781 TTCAGCAGAG CGCAGATACC AAATACTGTC CTTCTAGTGT AG C CGTAGTT AGGCCACCAC 
3 841 TTCAAGAACT CTGTAGCACC GCCTACATAC CTCGCTCTGC TAATCCTGTT ACCAGTGGCT 
3 901 GCTGCCAGTG GCGATAAGTC GTGTCTTACC GGGTTGGACT CAAGACGATA GTTACCGGAT 

3 961 AAGGCGCAGC GGTCGGGCTG AACGGGGGGT TCGTGCACAC AGCCCAGCTT GGAGCGAACG 

4 021 ACCTACACCG AACTGAGATA CCTACAGCGT GAGCTATGAG AAAGCGCCAC GCTTCCCGAA 
4 081 GGGAGAAAGG CGGACAGGTA TCCGGTAAGC GGCAGGGTCG GAACAGGAGA GCGCACGAGG 
4141 GAGCTTCCAG GGGGAAACGC CTGGTATCTT TATAGTC CTG TCGGGTTTCG CCACCTCTGA 
42 01 CTTGAGCGTC GATTTTTGTG ATGCTCGTCA GGGGGGCGGA GCCTATGGAA AAACGCCAGC 
42 61 AACGCGGCCT TTTTACGGTT CCTGGCCTTT TGCTGGCCTT TTGCTCACAT GTTCTTTCCT 
4 321 GCGTTATCCC CTGATTCTGT GGATAACCGT ATTACCGCCT TTGAGTGAGC TGATACCGCT 
4 381 CGCCGCAGCC GAACGACCGA GCGCAGCGAG TCAGTGAGCG AGGAAGCGGA AGAGCGCCTG 
4 441 ATGCGGTATT TTCTCCTTAC GCATCTGTGC GGTATTTCAC AC CG CATATA TGGTG CACTC 
4 501 TCAGTACAAT CTGCTCTGAT GCCGCATAGT TAAGCCAGTA TACACTCCGC TATCGCTACG 
4 561 TGACTGGGTC ATGGCTGCGC CCCGACACCC GCCAACACCC GCTGACGCGC CCTGACGGGC 
4 621 TTGTCTGCTC CCGGCATCCG C TTAC AG AC A AGCTGTGACC GTCTCCGGGA GCTGCATGTG 
4 681 TCAGAGGTTT TCACCGTCAT CACCGAAACG CGCGAGGCAG CTGCGGTAAA GCTCATCAGC 
4 741 GTGGTCGTGA AGCGATTCAC AGATGTCTGC CTGTTCATCC GCGTCCAGCT CGTTGAGTTT 
48 01 CTCCAGAAGC GTTAATGTCT GGCTTCTGAT AAAGCGGGCC ATGTTAAGGG CGGTTTTTTC 
4861 CTGTTTGGTC ACTGATGCCT CCGTGTAAGG GGGATTTCTG TTCATGGGGG TAATGATACC 
4 921 GATGAAACGA GAGAGGATGC TCACGATACG GGTTACTGAT GATGAACATG CCCGGTTACT 

4 981 GGAACGTTGT GAGGGTAAAC AACTGGCGGT ATGGATGCGG CGGGACCAGA G AAAAAT C AC 

5 041 TCAGGGTCAA TGCCAGCGCT TCGTTAATAC AGATGTAGGT GTTCCACAGG GTAGCCAGCA 
5101 GCATCCTGCG ATGCAGATCC GGAACATAAT GGTGCAGGGC GCTGACTTCC GCGTTTCCAG 
5161 ACTTTACGAA ACACGGAAAC C G AAG AC CAT TCATGTTGTT GCTCAGGTCG CAGACGTTTT 
52 21 GCAGCAGCAG TCGCTTCACG TTCGCTCGCG TATCGGTGAT TCATTCTGCT AACCAGTAAG 
52 81 GCAACCCCGC CAGCCTAGCC GGGTCCTCAA CGACAGGAGC ACGATCATGC GCACCCGTGG 
5341 CCAGGACCCA ACGCTGCCCG AGATGCGCCG CGTGCGGCTG CTGGAGATGG CGGACGCGAT 
54 01 GGATATGTTC TGCCAAGGGT TGGTTTGCGC ATTCACAGTT CTCCGCAAGA ATTGATTGGC 

54 61 TCCAATTCTT GGAGTGGTGA ATCCGTTAGC GAGGTGCCGC CGGCTTCCAT TCAGGTCGAG 
5 521 GTGGCCCGGC TCCATGCACC GCGACGCAAC GCGGGGAGGC AGACAAGGTA TAGGGCGGCG 

55 81 CCTACAATCC ATGCCAACCC GTTCCATGTG CTCGCCGAGG C GG CAT AAAT CGCCGTGACG 
5641 ? ATCAGCGGTC CAGTGATCGA AGTTAGGCTG GTAAGAG C CG CGAGCGATCC TTGAAGCTGT 
57 01 CCCTGATGGT CGTCATCTAC CTGCCTGGAC AGCATGGCCT GCAACGCGGG CATCCCGATG 

57 61 CCGCCGGAAG CGAGAAGAAT CATAATGGGG AAGGCCATCC AGCCTCGCGT CGCGAACGCC 
5 821 AGCAAGACGT AGCCCAGCGC GTCGGCCGCC ATGCCGGCGA TAATGGCCTG CTTCTCGCCG 

58 81 AAACGTTTGG TGGCGGGACC AGTGACGAAG GCTTGAGCGA GGGCGTGCAA GATTCCGAAT 

5 941 ACCGCAAGCG ACAGGCCGAT CATCGTCGCG CTCCAGCGAA AGCGGTCCTC GCCGAAAATG 

6 0 01 AC C C AG AG C G CTGCCGGCAC CTGTCCTACG AGTTGCATGA TAAAGAAGAC AG T CAT AAG T 
6 061 GCGGCGACGA TAGTCATGCC CCGCGCCCAC CGGAAGGAGC TGACTGGGTT GAAGGCTCTC 
6121 AAGGGCATCG GTCGATCGAC GCTCTCCCTT ATGCGACTCC TGCATTAGGA AGCAGCCCAG 
6181 TAGTAGGTTG AGGCCGTTGA GCACCGCCGC CG CAAGGAAT GGTGCATGCA AGGAGATGGC - 
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6241 GCCCAACAGT CCCCCGGCCA CGGGGCCTGC CACCATACCC ACGCCGAAAC AAGCGCTCAT 

63 01 GAGCCCGAAG TGGCGAGCCC GATCTTCCCC ATCGGTGATG TCGGCGATAT AGGCGCCAGC 

6361 AACCGCACCT GTGGCGCCGG TGATGCCGGC CACGATGCGT CCGGCGTAGA GGATCGAGAT 

6421 CT 
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Figure 3S7^ pDESTIS Glutathione-S-transferase Fusion in E. 

coli, T7 Promoter 
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pDEST15 7013 bp 



Location (Base Nos 
108 . . 776 
916 . . 792 
1025 . . 1537 
1804 . . 1888 
2026 . .2331 
2372 . .2496 
3233 . . 4093 



Gene Encoded 

GST 

attRl 

CmR 

inactivated ccdA 

ccdB 

attR2 

ampR 



1 ATCGAGATCT CGATCCCGCG AAATTAATAC 
61 CCTCTAGAAA TAATTTTGTT TAACTTTAAG 
121 GTTATTGGAA AATTAAGGGC CTTGTGCAAC 
181 AAAAATATGA AG AG C ATTTG TATGAGCGCG 
241 TTGAATTGGG TTTGGAGTTT CCCAATCTTC 
3 01 CACAGTCTAT GGCCATCATA CGTTATATAG 
361 CAAAAGAGCG TGCAGAGATT TCAATGCTTG 
421 TTTCGAGAAT TGCATATAGT AAAGACTTTG 
481 TACCTGAAAT GCTGAAAATG TTCGAAGATC 
541 ATCATGTAAC CCATCCTGAC TTCATGTTGT 
6 01 ACCCAATGTG CCTGGATGCG TTCCCAAAAT 
661 TCCCACAAAT TGATAAGTAC TTGAAATCCA 
721 GGCAAGCCAC GTTTGGTGGT GGCGACCATC 
781 GGTCGAATCA AACAAGTTTG TACAAAAAAG 
841 TCAATATATT AAATTAGATT TTG CATAAAA 
901 ATATCCAGTC ACTATGGCGG CCGCATTAGG 
961 CTCGTATAAT GTGTGGATTT TGAGTTAGGA 
1021 TAAAATGGAG AAAAAAATCA CTGGATATAC 
1081 AGAACATTTT GAGGCATTTC AGTCAGTTGC 
1141 GGATATTACG GCCTTTTTAA AGACCGTAAA 
12 01 TATTCACATT CTTGCCCGCC TGATG AATG C 

12 61 CGGTGAGCTG GTGATATGGG ATAGTGTTCA 

13 21 TGAAACGTTT TCATCGCTCT GGAGTGAATA 
13 81 ATATTCGCAA GATGTGGCGT GTTACGGTGA 
1441 TGAGAATATG TTTTTCGTCT CAGCCAATCC 
1501 CGTGGCCAAT ATGGACAACT TCTTCGCCCC 
1561 AGGCGACAAG GTGCTGATGC CGCTGGCGAT 
16 21 CCATGTCGGC AGAATG CTTA ATGAATTACA 
16 81 GTAATCTAGA GGATCCGGCT TACTAAAAG C 
1741 TTTTTGCGGT ATAAGAATAT ATACTGATAT 

18 01 GCTATGAAGC AG CGTATTAC AGTGACAGTT 
1861 TATATGATGT CAATATCTCC GGTCTGGTAA 
1921 TGCGTGCCGA ACGCTGGAAA GCGGAAAATC 

19 81 TTGAAATGAA CGGCTCTTTT GCTGACGAGA 
2 041 TACACCTATA AAAGAGAGAG CCGTTATCGT 
2101 GACACGCCCG GGCGACGGAT GGTGATCCCC 
2161 * GTCTCCCGTG AACTTTACCC GGTGGTGCAT 
2 221 ACCGATATGG CCAGTGTGCC GGTCTCCGTT 
22 81 CGCGAAAATG ACATCAAAAA CGCCATTAAC 
2 341 T C C CTTATAC ACAGCCAGTC TGCAGGTCGA 
24 01 GTATTATGTA GTCTGTTTTT TATGCAAAAT 
2 461 TTACGTTTCT CGTTCAGCTT TCTTGTACAA 
2521 CTAACAAAGC CCGAAAGGAA GGTGAGTTGG 
2581 AACCCCTTGG GGCCTCTAAA CGGGTCTTGA 
2 641 CCGGATATCC ACAGGACGGG TGTGGTCGCC 



GACTCACTAT AGGGAGACCA CAACGGTTTC 
AAGGAGATAT ACATATGTCC C C TATAC TAG 
CCACTCGACT TCTTTTGGAA TATCTTGAAG 
ATGAAGGTGA TAAATGGCGA AACAAAAAGT 
CTTATTATAT TGATGGTGAT GTTAAATTAA 
CTGACAAGCA CAACATGTTG GGTGGTTGTC 
AAGGAGCGGT TTTGGATATT AGATACGGTG 
AAACTCTCAA AGTTGATTTT CTTAGCAAGC 
GTTTATGT C A TAAAACATAT TTAAATGGTG 
ATGACGCTCT TGATGTTGTT TTATACATGG 
TAGTTTGTTT TAAAAAACGT ATTGAAGCTA 
GCAAGTATAT AGCATGGCCT TTGCAGGGCT 
CTCCAAAATC GGATCTGGTT CCGCGTCCAT 
CTGAACGAGA AACGTAAAAT GATATAAATA 
AACAGACTAC ATAATACTGT AAAACACAAC 
CACCCCAGGC TTTACACTTT ATGCTTCCGG 
TCCGTCGAGA TTTTCAGGAG CTAAGGAAGC 
CAC CGTTGAT ATATCCCAAT GGCATCGTAA 
TCAATGTACC TATAACCAGA CCGTTCAGCT 
GAAAAATAAG CACAAGTTTT ATCCGGCCTT 
TCATCCGGAA TTCCGTATGG CAATGAAAGA 
CCCTTGTTAC ACCGTTTTCC ATGAGCAAAC 
CCACGACGAT TTCCGGCAGT TTCTACACAT 
AAACCTGGCC TATTTCCCTA AAGGGTTTAT 
CTGGGTGAGT TTCAC CAGTT TTGATTTAAA 
CGTTTTCACC ATGGGCAAAT ATTATACGCA 
TCAGGTTCAT CATGCCGTCT GTGATGG CTT 
ACAGTACTGC GATGAGTGGC AGGGCGGGGC 
CAGATAACAG TATG CGTATT TGCGCGCTGA 
GTATAC C CGA AGTATGTCAA AAAG AG GTGT 
GACAGCGACA GCTATCAGTT GCTCAAGGCA 
GCACAACCAT G C AGAATG AA GCCCGTCGTC 
AGGAAGGGAT GGCTGAGGTC GCCCGGTTTA 
ACAGGGACTG G TGAAATG C A GTTTAAGGTT 
CTGTTTGTGG ATGTACAGAG TGATATTATT 
CTGGCCAGTG CACGTCTGCT GTCAGATAAA 
ATCGGGGATG AAAGCTGGCG C ATGATGAC C 
ATCGGGGAAG AAGTGGCTGA TCTCAGCCAC 
CTGATGTTCT GGGGAATATA AATGTCAGGC 
CCATAGTGAC TGGATATGTT GTGTTTTACA 
CTAATTTAAT ATATTGATAT TTATATCATT 
AGTGGTTTGA TTCGACCCGG GATC CGGCTG 
CTGCTGCCAC CGCTGAGCAA T AAC TAG CAT 
GGGGTTTTTT GCTGAAAGGA G G AAC TAT AT 
ATGATCGCGT AGTCGATAGT GGCTCCAAGT - 
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27 01 AGCGAAGCGA GCAGGACTGG GCGGCGGCCA AAGCGGTCGG ACAGTGCTCC GAGAACGGGT 
2 761 GCGCATAGAA ATTGCATCAA CGCATATAGC GCTAGCAGCA CGCCATAGTG ACTGGCGATG 
2 821 CTGTCGGAAT G G AC GAT AT C CCGCAAGAGG CCCGGCAGTA CCGGCATAAC CAAGCCTATG 
2 881 CCTACAGCAT CCAGGGTGAC GGTGCCGAGG ATGACGATGA GCGCATTGTT AGATTTCATA 

2 941 CACGGTGCCT GACTGCGTTA GCAATTTAAC TGTGATAAAC T AC C G C ATT A AAGCTTATCG 

3 001 ATGATAAGCT GTCAAACATG AGAATTCTTG AAGACGAAAG GGCCTCGTGA TACGCCTATT 
3 061 TTTATAGGTT AATGTCATGA TAATAATGGT TTCTTAGACG TCAGGTGGCA CTTTTCGGGG 
3121 AAATGTGCGC GGAACCCCTA TTTGTTTATT TTTCTAAATA CATTCAAATA TGTATCCGCT 
3181 CATGAGACAA TAACCCTGAT AAATGCTTCA ATAATATTGA AAAAGGAAGA GTATGAGTAT 
3241 TCAACATTTC CGTGTCGCCC TTATTCCCTT TTTTGCGGCA TTTTGCCTTC CTGTTTTTGC 

33 01 TCACCCAGAA ACGCTGGTGA AAGTAAAAGA TGCTGAAGAT CAGTTGGGTG CACGAGTGGG 
3 361 TTACATCGAA CTGGATCTCA ACAGCGGTAA GATCCTTGAG AGTTTTCGCC CCGAAGAACG 
3421 TTTTCCAATG ATGAGCACTT TTAAAGTTCT GCTATGTGGC GCGGTATTAT CCCGTGTTGA 

34 81 CGCCGGGCAA GAGCAACTCG GTCGCCGCAT ACACTATTCT CAGAATGACT TGGTTGAGTA 
3 541 CTCACCAGTC AC AG AAAAG C ATCTTACGGA TGGCATGACA GTAAGAGAAT TATGCAGTGC 
3 6 01 TGCCATAACC ATGAGTGATA ACACTGCGGC CAACTTACTT CTGACAACGA TCGGAGGACC 
3 661 GAAGGAGCTA ACCGCTTTTT TG C AC AAC AT GGGGGATCAT GTAACTCGCC TTGATCGTTG 
3 721 GGAACCGGAG CTGAATGAAG C CAT AC C AAA CGACGAGCGT G AC AC C ACG A TGCCTGCAGC 
3 781 AATGGCAACA ACGTTGCGCA AACTATTAAC TGGCGAACTA CTTACTCTAG CTTCCCGGCA 
3 841 ACAATTAATA GACTGGATGG AGGCGGATAA AGTTGCAGGA CCACTTCTGC GCTCGGCCCT 
3 901 TCCGGCTGGC TGGTTTATTG CTGATAAATC TGGAGCCGGT GAGCGTGGGT CTCG CGGTAT 

3 961 CATTGCAGCA CTGGGGCCAG ATGGTAAGCC CTCCCGTATC GTAGTTATCT ACACGACGGG 

4 021 GAGTCAGGCA ACTATGGATG AACGAAATAG ACAGATCGCT GAGATAGGTG CCTCACTGAT 
4 081 TAAGCATTGG TAACTGTCAG ACCAAGTTTA C T CAT AT AT A CTTTAGATTG ATTTAAAACT 
4141 TCATTTTTAA TTTAAAAGGA TCTAGGTGAA GATCCTTTTT GATAATCTCA TG AC C AAAAT 
42 01 CCCTTAACGT GAGTTTTCGT TCCACTGAGC GTCAGACCCC G TAG AAAAG A TCAAAGGATC 
42 61 TTCTTGAGAT CCTTTTTTTC TGCGCGTAAT CTGCTGCTTG CAAACAAAAA AACCACCGCT 
4 321 AC C AG C GGTG GTTTGTTTGC CGGATCAAGA GCTACCAACT CTTTTTCCGA AGGTAACTGG 
4 381 CTTCAGCAGA GCGCAGATAC CAAATACTGT CCTTCTAGTG TAGCCGTAGT TAGGCCACCA 
4441 CTTCAAGAAC TCTGTAGCAC CGCCTACATA CCTCGCTCTG CTAATCCTGT TACCAGTGGC 

45 01 TGCTGCCAGT GGCGATAAGT CGTGTCTTAC CGGGTTGGAC TCAAGACGAT AGTTAC CGGA 
4 561 TAAGGCGCAG CGGTCGGGCT GAACGG GGGG TTCGTG CACA CAGCCCAGCT TGGAGCGAAC 
4 621 GACCTACACC GAACTGAGAT ACCTACAGCG TGAGCTATGA GAAAGCGCCA CGCTTCCCGA 

46 81 AGGGAGAAAG GCGGACAGGT ATCCGGTAAG CGGCAGGGTC G G AAC AGG AG AGCGCACGAG 
4 741 GGAGCTTCCA GGGGGAAACG CCTGGTATCT TTATAGTCCT GTCGGGTTTC GCCACCTCTG 
48 01 ACTTGAGCGT CGATTTTTGT GATGCTCGTC AGGGGGGCGG AGCCTATGGA AAAACGCCAG 
4861 CAACGCGGCC TTTTTACGGT TCCTGGCCTT TTGCTGGCCT TTTGCTCACA TGTTCTTTCC 
4 921 TGCGTTATCC CCTGATTCTG TGGATAACCG TATTACCGCC TTTGAGTGAG CTGATACCGC 

4 981 TCGCCGCAGC CGAACGACCG AGCGCAGCGA GTCAGTGAGC GAGGAAGCGG AAGAGCGCCT 

5 041 GATG CGGTAT TTTCTCCTTA CGCATCTGTG CGGTATTTCA CACCGCATAT ATGGTGCACT 
5101 CTCAGTACAA TCTGCTCTGA TG C CG CAT AG TTAAGCCAGT ATACACTCCG CTATCGCTAC 
5161 GTGACTGGGT CATGGCTGCG CCCCGACACC CGCCAACACC CGCTGACGCG CCCTGACGGG 
5221 CTTGTCTGCT CCCGGCATCC G CTTAC AG AC AAGCTGTGAC CGTCTCCGGG AGCTGCATGT 
52 81 GTCAGAGGTT TTCACCGTCA TCACCGAAAC GCGCGAGGCA GCTGCGGTAA AGCTCATCAG 
5 341 CGTGGTCGTG AAGCGATTCA CAGATGTCTG CCTGTTCATC CGCGTCCAGC TCGTTGAGTT 
54 01 TCTCCAGAAG CGTTAATGTC TGGCTTCTGA TAAAGCGGGC CATGTTAAGG GCGGTTTTTT 
54 61 CCTGTTTGGT CACTGATGCC TCCGTGTAAG GGGGATTTCT GTTCATGGGG GTAATGATAC 
5521 CGATGAAACG AGAGAGGATG CTCACGATAC GGGTTACTGA TGATGAACAT GCCCGGTTAC 
5 581c TGGAACGTTG TG AGG GT AAA CAACTGGCGG TATGGATGCG GCGGGACCAG AG AAAAAT C A 

56 41 CTCAGGGTCA ATGCCAGCGC TTCGTTAATA CAGATGTAGG TGTTCCACAG GGTAGCCAGC 

57 01 AGCATCCTGC GATGCAGATC CGGAACATAA TGGTGCAGGG CGCTGACTTC CGCGTTTCCA 

57 61 GACTTTACGA AACACGGAAA CCGAAGACCA TTCATGTTGT TGCTCAGGTC GCAGACGTTT 
5821 TG C AG C AG C A GTCGCTTCAC GTTCGCTCGC GTATCGGTGA TTCATTCTGC T AAC C AGTAA 

58 81 GGCAACCCCG CCAGCCTAGC CGGGTCCTCA ACGACAGGAG CACGATCATG CGCACCCGTG 

5 941 GCCAGGACCC AACGCTGCCC GAGATGCGCC GCGTGCGGCT GCTGGAGATG GCGGACGCGA 

6 0 01 TGGATATGTT CTGCCAAGGG TTGGTTTGCG CATTCACAGT TCTCCGCAAG AATTGATTGG 
6 061 CTCCAATTCT TGGAGTGGTG AATCCGTTAG CGAGGTGCCG CCGGCTTCCA TTCAGGTCGA 
6121 GGTGGCCCGG CTCCATGCAC CGCGACGCAA CGCGGGGAGG CAGACAAGGT ATAGGGCGGC - 
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6181 GCCTACAATC CATGCCAACC CGTTCCATGT GCTCGCCGAG GCGGCATAAA TCGCCGTGAC 
6241 GATCAGCGGT CCAGTGATCG AAGTTAGGCT GGTAAGAGCC GCGAGCGATC CTTGAAGCTG 

63 01 TCCCTGATGG TCGTCATCTA CCTGCCTGGA CAGCATGGCC TGCAACGCGG GCATCCCGAT 
6 361 GCCGCCGGAA G C G AG AAGAA TCATAATGGG GAAGG C CATC CAGCCTCGCG TCGCGAACGC 
6 421 CAGCAAGACG TAGCCCAGCG CGTCGGCCGC CATGCCGGCG ATAATGGCCT GCTTCTCGCC 

64 81 GAAACGTTTG GTGGCGGGAC CAGTGACGAA GGCTTGAGCG AGGGCGTGCA AG ATT C C G AA 
6 541 TACCGCAAGC GACAGGCCGA TCATCGTCGC GCTCCAGCGA AAGCGGTCCT CG CCGAAAAT 
66 01 GACCCAGAGC GCTGCCGGCA CCTGTCCTAC GAGTTGCATG ATAAAGAAGA CAGTCATAAG 
6 661 TGCGGCGACG ATAGTCATGC CCCGCGCCCA CCGGAAGGAG CTGACTGGGT TGAAGGCTCT 
6 721 CAAGGGCATC GGTCGATCGA CGCTCTCCCT TATGCGACTC CTG CATTAGG AAGCAGCCCA 
6 781 GTAGTAGGTT GAGGCCGTTG AGCACCGCCG CCGCAAGGAA TGGTGCATGC AAGGAGATGG 
6 841 CGCCCAACAG TCCCCCGGCC ACGGGGCCTG CCACCATACC CACGCCGAAA CAAGCGCTCA 
6 901 TGAGCCCGAA GTGGCGAGCC CGATCTTCCC CAT CGGTGAT GTCGGCGATA TAGGCGCCAG 
6 961 CAACCGCACC TGTGGCGCCG GTGATGCCGG CCACGATGCG TCCGGCGTAG AGG 
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PDEST16 6675 bp 



Location (Bass Wns 



104 . 
585 . 
694 . 
1473 
1695 
2041 



457 

461 

1353 

. 1557 

.2000 

.2165 



Gene Encoded 

trxA 

attRl 

CmR 

inactivated ccdA 

ccdB 

attR2 



1 AGATCTCGAT CCCGCGAAAT TAATACGACT CACTATAGGG AGACCACAAC GGTTTCCCTC 
61 TAGAAATAAT TTTGTTTAAC TTTAAGAAGG AGATATACAT ATG AG CG ATA AAATTATTCA 
121 CCTGACTGAC GACAGTTTTG ACACGGATGT ACTCAAAGCG GACGGGGCGA TCCTCGTCGA 
181 TTTCTGGGCA GAGTGGTGCG GTCCGTGCAA AATGATCGCC CCGATTCTGG ATGAAATCGC 
241 TGACGAATAT CAGGGCAAAC TGACCGTTGC AAAACTGAAC ATCGATCAAA ACCCTGGCAC 
301 TGCGCCGAAA TATGGCATCC GTGGTATCCC GACTCTGCTG CTGTTCAAAA ACGGTGAAGT 
361 GGCGGCAACC AAAGTGGGTG CACTGTCTAA AGGTCAGTTG AAAGAGTTCC TCGACGCTAA 
421 CCTGGCCGGT TCTGGTTCTG GTGATGACGA TGACAAGATC ACAAGTTTGT ACAAAAAAGC 
481 TGAACGAGAA ACGTAAAATG ATATAAATAT CAATATATTA AATTAGATTT TGCATAAAAA 
541 ACAGACTACA TAATACTGTA AAACACAACA TATCCAGTCA CTATGGCGGC CGCATTAGGC 
601 ACCCCAGGCT TTACACTTTA TGCTTCCGGC TCGTATAATG TGTGGATTTT GAGTTAGGAT 
661 CCGGCGAGAT TTTCAGGAGC TAAGGAAGCT AAAATGGAGA AAAAAATCAC TGGATATACC 
721 ACCGTTGATA TATCCCAATG GCATCGTAAA GAACATTTTG AGGCATTTCA GTCAGTTGCT 
781 CAATGTACCT ATAACCAGAC CQTTCAGCTG GATATTACGG CCTTTTTAAA GACCGTAAAG 
841 AAAAATAAGC ACAAGTTTTA TCCGGCCTTT ATTCACATTC TTGCCCGCCT GATGAATGCT 
901 CATCCGGAAT TCCGTATGGC AATGAAAGAC GGTGAGCTGG TGATATGGGA TAGTGTTCAC 
961 C CTTGTTAC A CCGTTTTCCA TGAGCAAACT GAAACGTTTT CATCGCTCTG GAGTGAATAC 
1021 CACGACGATT TCCGGCAGTT TCTACACATA TATTCGCAAG ATGTGGCGTG TTACGGTGAA 
1081 AACCTGGCCT ATTTCCCTAA AGGGTTTATT GAGAATATGT TTTTCGTCTC AGCCAATCCC 
1141 TGGGTGAGTT TCACCAGTTT TGATTTAAAC GTGGC C AATA TGGACAACTT CTTCGCCCCC 
12 01 GTTTTCACCA TGGGCAAATA TTATACGCAA GGCGACAAGG TGCTGATGCC GCTGGCGATT 

12 61 CAGGTTCATC ATG CCGTCTG TGATGGCTTC CATGTCGGCA GAATGCTTAA TGAATTACAA 
1321 CAGTACTGCG ATGAGTGGCA GGGCGGGGCG TAAACGCGTG GATCCGGCTT ACTAAAAGCC 

13 81 AGATAACAGT ATGCGTATTT GCGCGCTGAT TTTTGCGGTA TAAGAATATA TACTGATATG 

14 41 TATACCCGAA GTATGTCAAA AAGAGGTGTG CTATGAAGCA GCGTATTACA GTGACAGTTG 

15 01 ACAGCGACAG CTATCAGTTG CTCAAGGCAT ATATGATGTC AATATCTCCG GTCTGGTAAG 
1561 CACAACCATG CAGAATGAAG CCCGTCGTCT GCGTGCCGAA CGCTGGAAAG CGGAAAATCA 
1621 GGAAGGGATG GCTGAGGTCG CCCGGTTTAT TGAAATGAAC GGCTCTTTTG CTGACGAGAA 
1681 CAGGGACTGG TGAAATGCAG TTTAAGGTTT ACACCTATAA AAGAGAGAGC CGTTATCGTC 
1741 TGTTTGTGGA TGTACAGAGT GATATTATTG ACACGCCCGG GCGACGGATG GTGATCCCCC 
18 01 TGGCCAGTGC ACGTCTGCTG TCAGATAAAG TCTCCCGTGA ACTTTACCCG GTGGTGCATA 
18 61 TCGGGGATGA AAGCTGGCGC ATGATGACCA CCGATATGGC CAGTGTGCCG GTCTCCGTTA 
1921 TCGGGGAAGA AGTGGCTGAT CTCAGCCACC GCGAAAATGA CATCAAAAAC GCCATTAACC 
1981 TGATGTTCTG GGGAATATAA ATGTCAGGCT C C CTTATAC A CAGCCAGTCT GCAGGTCGAC 
2 041 CATAGTGACT GGATATGTTG TGTTTTACAG TATTATGTAG TCTGTTTTTT ATGCAAAATC 
2101 TAATTTAATA TATTGATATT TATATCATTT TACGTTT CTC GTTCAGCTTT CTTGTACAAA 
2161 GTGGTGATGA TCCGGCTGCT AACAAAGCCC GAAAGGAAGC TGAGTTGGCT GCTGCCACCG 
2221 CTGAG C AATA ACTAG CATAA CCCCTTGGGG CCTCTAAACG GGTCTTGAGG GGTTTTTTGC 
2281 TGAAAGGAGG AACTATATCC GGATATCCAC AGGACGGGTG TGGTCGCCAT GATCGCGTAG 
2 341 TCGATAGTGG CTCCAAGTAG CGAAGCGAGC AGGACTGGGC GGCGGCCAAA GCGGTCGGAC 
2401 AGTGCTCCGA GAACGGGTGC GCATAGAAAT TGCATCAACG CATATAGCG C TAG C AG C AC G 
2461 CCATAGTGAC TGGCGATGCT GTCGGAATGG ACGATATCCC GCAAGAGGCC CGGCAgScC 
2521 GGCATAACCA AGCCTATGCC TACAGCATCC AGGGTGACGG TGCCGAGGAT GAcStSgC 

2581 gcattgttag atttcataca cggtgcctga ctgcgttagc aatttaactg tgatSJcta 
2641 ccgcattaaa gcttatcgat gataagctgt caaacatgag aattcttgaa gacgaaaS 
2701 cctcgtgata cgcctatttt tataggttaa tgtcatgata ataatggttt ctSSSSc 
2 761 aggtggcact tttcggggaa atgtgcgcgg aacccctatt tgtttatttt tctaaataca 
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2 821 TTCAAATATG TATCCGCTCA TGAGACAATA ACCCTGATAA ATGCTTCAAT AATATTGAAA 
28 81 AAGGAAGAGT ATGAGTATTC AACATTTCCG TGTCGCCCTT ATTCCCTTTT TTGCGGCATT 

2 941 TTGCCTTCCT GTTTTTG CTC ACCCAGAAAC GCTGGTGAAA GTAAAAGATG CTGAAGATCA 

3 0 01 GTTGGGTGCA CGAGTGGGTT ACATCGAACT GGATCTCAAC AG CGGTAAGA TCCTTGAGAG 
3 061 TTTTCGCCCC GAAGAACGTT TTCCAATGAT GAGCACTTTT AAAGTTCTGC TATGTGGCGC 
3121 GGTATTATCC CGTGTTGACG CCGGGCAAGA GCAACTCGGT CGCCGCATAC ACTATTCTCA 
3181 GAATGACTTG GTTGAGTACT CACCAGTCAC AGAAAAGCAT CTTACGGATG GCATGACAGT 
3241 AAGAGAATTA TGCAGTGCTG C C AT AAC CAT GAGTGATAAC ACTGCGGCCA ACTTACTTCT 
3 3 01 GACAACGATC GGAGGACCGA AGGAGCTAAC CGCTTTTTTG CACAACATGG GGGATCATGT 
3361 AACTCGCCTT GATCGTTGGG AACCGGAGCT GAATGAAGCC ATACCAAACG ACG AG CGTGA 
3 421 CACCACGATG CCTGCAGCAA TGGCAACAAC GTTGCGCAAA CTATTAACTG GCGAACTACT 
34 81 TACTCTAGCT TCCCGGCAAC AATTAATAGA CTGGATGGAG GCGGATAAAG TTGCAGGACC 
3 541 ACTTCTGCGC TCGGCCCTTC CGGCTGGCTG GTTTATTGCT GATAAATCTG GAGCCGGTGA 
3 6 01 GCGTGGGTCT CGCGGTATCA TTG C AG C ACT GGGGCCAGAT GGTAAGCCCT CCCGTATCGT 
3661 AGTTATCTAC ACGACGGGGA GT C AG G C AAC TATGGATGAA CGAAATAGAC AGATCGCTGA 
3 721 GATAGGTGCC TCACTGATTA AG CATTGGTA ACTGTCAGAC CAAGTTTACT CATATATACT 
3 781 TTAGATTGAT TTAAAACTTC ATTTTTAATT TAAAAGGATC TAGGTGAAGA TCCTTTTTGA 
3 841 TAATCTCATG ACCAAAATCC CTTAACGTGA GTTTTCGTTC CACTGAGCGT CAGACCCCGT 
3 901 AGAAAAGATC AAAGGATCTT CTTGAGATCC TTTTTTTCTG CGCGTAATCT GCTGCTTGCA 

3 961 AACAAAAAAA CCACCGCTAC CAGCGGTGGT TTGTTTGCCG GATCAAGAGC TACCAACTCT 

4 021 TTTTCCGAAG GTAACTGGCT TCAGCAGAGC GCAGATACCA AATACTGTCC TTCTAGTGTA 
4 081 GCCGTAGTTA GGCCACCACT TCAAGAACTC TGTAGCACCG CCTACATACC TCGCTCTGCT 
4141 AATCCTGTTA CCAGTGGCTG CTGCCAGTGG CGATAAGTCG TGTCTTACCG GGTTGGACTC 
42 01 AAGACGATAG TTACCGGATA AGGCGCAGCG GTCGGGCTGA ACGGGGGGTT CGTGCACACA 

42 61 GCCCAGCTTG GAGCGAACGA CCTACACCGA ACTGAGATAC CTACAGCGTG AGCTATGAGA 
4 321 AAGCGCCACG CTTCCCGAAG GGAGAAAGGC GGACAGGTAT CCGGTAAGCG GCAGGGTCGG 

43 81 AACAGGAGAG CGCACGAGGG AGCTTCCAGG GGGAAACGCC TGGTATCTTT ATAGTCCTGT 
4441 CGGGTTTCGC CACCTCTGAC TTGAGCGTCG ATTTTTGTGA TGCTCGTCAG GGGGGCGGAG 

45 01 CCTATGGAAA AACGCCAGCA ACGCGGCCTT TTTACGGTTC CTGGCCTTTT GCTGGCCTTT 
4 561 TGCTCACATG TTCTTTCCTG CGTTATCCCC TGATTCTGTG GATAAC CGTA TTACCGCCTT 

46 21 TGAGTGAGCT GATACCGCTC GCCGCAGCCG AACGACCGAG CGCAGCGAGT C AG TG AG C G A 
46 81 GGAAG CGGAA GAGCGCCTGA TGCGGTATTT TCTCCTTACG CATCTGTGCG GTATTTCACA 
4 741 CCGCATATAT GGTGCACTCT CAGTACAATC TGCTCTGATG CCGCATAGTT AAGCCAGTAT 
4 8 01 ACACTCCGCT ATCGCTACGT GACTGGGTCA TGGCTGCGCC CCGACACCCG CCAACACCCG 
48 61 CTGACGCGCC CTGACGGGCT TGTCTGCTCC CGGCATCCGC TTACAGACAA GCTGTGACCG 
4 921 TCTCCGGGAG CTGCATGTGT CAGAGGTTTT CACCGTCATC ACCGAAACGC GCGAGGCAGC 

4 981 TGCGGTAAAG CTCATCAGCG TGGTCGTGAA GCGATTCACA GATGTCTGCC TGTTCATCCG 

5 041 CGTCCAGCTC GTTGAGTTTC TCCAGAAGCG TTAATGTCTG GCTTCTGATA AAGCGGGCCA 
5101 TGTTAAGGGC GGTTTTTTCC TGTTTGGTCA CTGATGCCTC CGTGTAAGGG GGATTTCTGT 
5161 TCATGGGGGT AATGATACCG ATGAAACGAG AGAGGATGCT CACGATACGG GTTACTGATG 
5221 ATGAACATGC CCGGTTACTG GAACGTTGTG AGGGTAAACA ACTGGCGGTA TGGATGCGGC 
52 81 GGGAC CAGAG AAAAATCACT CAGGGTCAAT GCCAGCGCTT C GTTAATAC A GATGTAGGTG 
5341 TTCCACAGGG TAG C C AG C AG CATCCTGCGA TGCAGATCCG GAACATAATG GTGCAGGGCG 
54 01 CTGACTTCCG CGTTTCCAGA CTTTACGAAA CACGGAAACC GAAGACCATT CATGTTGTTG 

54 61 CTCAGGTCGC AGACGTTTTG CAGCAGCAGT CGCTTCACGT TCGCTCGCGT ATCGGTGATT 
'5521 CATTCTGCTA ACCAGTAAGG CAACCCCGCC AGCCTAGCCG GGTCCTCAAC GACAGGAGCA 

55 81 CGATCATGCG CACCCGTGGC C AG G AC C C AA CGCTGCCCGA GATGCGCCGC GTGCGGCTGC 
5641 TGGAGATGGC GGACGCGATG GATATGTTCT GCCAAGGGTT GGTTTGCGCA TTCACAGTTC 
5701, TCCGCAAGAA TTGATTGGCT CCAATTCTTG GAGTGGTGAA TCCGTTAGCG AGGTGCCGCC 
5 761 GGCTTCCATT CAGGTCGAGG TGGCCCGGCT CCATGCACCG CGACGCAACG CGGGGAGGCA 
58 21 GACAAGGTAT AGGGCGGCGC CTACAATCCA TGCCAACCCG TTCCATGTGC TCGCCGAGGC 
58 81 GGCATAAATC GCCGTGACGA TCAGCGGTCC AGTGATCGAA GTTAGGCTGG TAAGAGCCGC 

5 941 GAGCGATCCT TGAAGCTGTC CCTGATGGTC GTCATCTACC TGCCTGGACA GCATGGCCTG 

6 0 01 CAACGCGGGC ATCCCGATGC CGCCGGAAGC GAGAAGAATC ATAATGGGGA AGGCCATCCA 
6 061 GCCTCGCGTC GCGAACGCCA GCAAGACGTA GCCCAGCGCG TCGGCCGCCA TGCCGGCGAT 
6121 AATGGCCTGC TTCTCGCCGA AACGTTTGGT GGCGGGACCA GTGACGAAGG CTTGAGCGAG 
6181 GGCGTGCAAG ATTCCGAATA CCGCAAGCGA CAGGCCGATC ATCGTCGCGC TCCAGCGAAA 
6241 GCGGTCCTCG CCGAAAATGA CCCAGAGCGC TGCCGGCACC TGTC CTACG A GTTGCATGAT - 

^(^e ike 
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63 01 AAAGAAGACA GTCATAAGTG CGGCGACGAT AGTCATGCCC CGCGCCCACC GGAAGGAGCT 
6361 GACTGGGTTG AAGGCTCTCA AGGGCATCGG TCGATCGACG CTCTCCCTTA TGCGACTCCT 
6421 GCATTAGGAA GCAGCCCAGT AGTAGGTTGA GGC CGTTGAG CACCGCCGCC GCAAGGAATG 

64 81 GTGCATGCAA GGAGATGGCG CCCAACAGTC CCCCGGCCAC GGGGCCTGCC ACCATACCCA 
6 541 CGCCGAAACA AGCGCTCATG AGCCCGAAGT GGCGAGCCCG ATCTTCCCCA TCGGTGATGT 
66 01 C GG C GAT AT A GGCGCCAGCA ACCGCACCTG TGGCGCCGGT GATGCCGGCC ACGATGCGTC 
66 61 CGGCGTAGAG GATCG 
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pDEST17 6354 bp 

Location (Base Nos . ) Gene Encoded 

258 . . 134 attRl 

367 . . 1026 CmR 

1146..1230 inactivated ccdA 

1368 . . 1673 ccdB 

1714 . . 1838 attR2 

2564 . .3421 ampR 

1 CGATCCCGCG AAATTAATAC GACTCACTAT AGGG AG AC C A CAACGGTTTC CCTCTAGAAA 
61 TAATTTTGTT TAACTTTAAG AAGGAGATAT ACATATGTCG TACTAC CATC ACCATCACCA 
121 TCACCTCGAA TCAACAAGTT TGTACAAAAA AGCTGAACGA G AAAC G T AAA ATGATATAAA 
181 TATCAATATA TTAAATTAGA TTTTG CATAA AAAACAGACT ACATAATACT GTAAAACACA 
241 AC ATAT C C AG TCACTATGGC GGCCGCATTA GGCACCCCAG G CTTTAC ACT TTATGCTTCC 
3 01 GGCTCGTATA ATGTGTGGAT TTTGAGTTAG GATCCGTCGA GATTTTCAGG AGCTAAGGAA 
361 GCTAAAATGG AGAAAAAAAT CACTGGATAT ACCACCGTTG ATATATCCCA ATGGCATCGT 
421 AAAGAACATT TTGAGGCATT TCAGTCAGTT GCTCAATGTA CCTATAACCA GAC CGTTCAG 
481 CTGGATATTA CGGCCTTTTT AAAGAC CGTA AAGAAAAATA AG C AC AAGTT TTATCCGGCC 
541 TTTATTCACA TTCTTGCCCG CCTGATGAAT GCTCATCCGG AATTCCGTAT GGCAATGAAA 
601 GACGGTGAGC TGGTGATATG GGATAGTGTT CACCCTTGTT ACACCGTTTT CCATGAGCAA 
661 ACTGAAACGT TTTCATCGCT CTGGAGTGAA TACCACGACG ATTTCCGGCA GTTTCTACAC 
721 ATATATTCGC AAGATGTGGC GTGTTACGGT GAAAACCTGG CCTATTTCCC TAAAGGGTTT 
781 ATTGAGAATA TGTTTTTCGT CTCAGCCAAT CCCTGGGTGA GTTTCACCAG TTTTGATTTA 
841 AACGTGGCCA ATATGGACAA CTTCTTCGCC CCCGTTTTCA CCATGGGCAA ATATTATACG 
9 01 CAAGG CGACA AGGTG CTGAT GCCGCTGGCG ATTCAGGTTC ATCATGCCGT CTGTGATGGC 
961 TTCCATGTCG GCAGAATGCT TAATGAATTA CAACAGTACT GCGATGAGTG GCAGGGCGGG 
1021 G CGTAAAG AT CTGGATCCGG CTTACTAAAA GCCAGATAAC AGTATGCGTA TTTGCGCGCT 
1081 GATTTTTGCG GTATAAGAAT ATATACTGAT ATGTAT AC C C GAAGTATGTC AAAAAGAGGT 
1141 GTGCTATGAA G C AG C GTATT ACAGTGACAG TTGACAGCGA CAGCTATCAG TTGCTCAAGG 
12 01 CATATATGAT GTCAATATCT CCGGTCTGGT AAGCACAACC ATGCAGAATG AAGCCCGTCG 

12 61 TCTGCGTGCC GAACGCTGGA AAG CGGAAAA TCAGGAAGGG ATGGCTGAGG TCGCCCGGTT 
1321 TATTGAAATG AACGGCTCTT TTGCTGACGA GAACAGGGAC TGGTGAAATG CAGTTTAAGG 

13 81 TTTAC AC CTA TAAAAGAGAG AGCCGTTATC GTCTGTTTGT GGATGTACAG AG TG AT ATTA 

14 41 TTGACACGCC CGGGCGACGG ATGGTGATCC CCCTGGCCAG TGCACGTCTG CTGT C AG ATA 

15 01 AAGTCTCCCG TGAACTTTAC CCGGTGGTGC ATATCGGGGA TGAAAGCTGG CGCATGATGA 
1561 C C AC C GAT AT GGCCAGTGTG CCGGTCTCCG TTATCGGGGA AGAAGTGGCT GATCTCAGCC 

16 21 ACCGCGAAAA TGACATCAAA AACGCCATTA ACCTGATGTT CTGGGGAATA TAAATGTCAG 
16 81 GCTCCCTTAT AC AC AG C C AG TCTGCAGGTC GACCATAGTG ACTGGATATG TTGTGTTTTA 
1741 CAGTATTATG TAGTCTGTTT TTTATG C AAA ATCTAATTTA ATATATTGAT ATTTATATCA 
18 01 TTTTACGTTT CTCGTTCAGC TTTCTTGTAC AAAGTGGTTG ATTCGAGGCT GCTAACAAAG 
18 61 CCCGAAAGGA AGCTGAGTTG GCTGCTGCCA CCGCTGAGCA ATAACTAG C A TAACCCCTTG 

,1921 GGGCCTCTAA ACGGGTCTTG AGGGGTTTTT TGCTGAAAGG AGGAACTATA TCCGGATATC 

1981 CACAGGACGG GTGTGGTCGC CATGATCGCG TAGTCGATAG TGGCTCCAAG TAGCGAAGCG 

2 041 AGCAGGACTG GGCGGCGGCC AAAGCGGTCG GACAGTGCTC CGAGAACGGG TG C G CAT AG A 

2101 AATTGCATCA ACGCATATAG CGCTAGCAGC ACGC CATAGT GACTGGCGAT GCTGTCGGAA 

216 V TGGACGATAT C C C G C AAG AG GCCCGGCAGT AC CGG CATAA CCAAGCCTAT GCCTACAGCA 

2221 TCCAGGGTGA CGGTGCCGAG GATGACGATG AGCGCATTGT TAGATTTCAT ACACGGTGCC 

22 81 TGACTG CGTT AG C AATTTAA CTGTGATAAA CTACCGCATT AAAGCTTATC GATGATAAGC 

2 341 TGTCAAACAT GAGAATTCTT GAAGACGAAA GGGCCTCGTG ATACGCCTAT TTTTATAGGT 

24 01 TAATGT CATG ATAATAATGG TTTCTTAGAC GTCAGGTGGC ACTTTTCGGG GAAATGTGCG 

24 61 CGGAACCCCT ATTTGTTTAT TTTTCTAAAT AC ATT C AAAT ATGTATCCGC TCATGAGACA 

2 521 ATAACCCTGA TAAATGCTTC AATAATATTG AAAAAGGAAG AGTATGAGTA TTCAACATTT 

2581 CCGTGTCGCC CTTATTCCCT TTTTTG CGG C ATTTTGCCTT CCTGTTTTTG CTCACCCAGA 

2641 AACGCTGGTG AAAGTAAAAG ATG CTGAAGA TCAGTTGGGT GCACGAGTGG GTTACATCGA- 
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27 01 ACTGGATCTC AACAGCGGTA AGATCCTTGA GAGTTTTCGC CCCGAAGAAC GTTTTCCAAT 

27 61 GATGAGCACT TTTAAAGTTC TGCTATGTGG CGCGGTATTA TCCCGTGTTG ACGCCGGGCA 

28 21 AGAGCAACTC GGTCGCCGCA TACACTATTC TCAGAATGAC TTGGTTGAGT ACTCACCAGT 
28 81 CACAGAAAAG CATCTTACGG ATGGCATGAC AGTAAGAGAA TTATGCAGTG CTGCCATAAC 

2 941 CATGAGTGAT AACACTG CGG CCAACTTACT TCTGACAACG ATCGGAGGAC CGAAGGAGCT 

3 001 AACCGCTTTT TTGCACAACA TGGGGGATCA TGTAACTCGC CTTGATCGTT GGGAACCGGA 
3 061 GCTGAATGAA G C CAT AC C AA ACGACGAGCG TGACACCACG ATGCCTGCAG CAATGGCAAC 
3121 AACGTTGCGC AAACTATTAA CTGGCGAACT ACTTACTCTA GCTTCCCGGC AACAATTAAT 
3181 AGACTGGATG GAGG CGGATA AAGTTGCAGG ACCACTTCTG CGCTCGGCCC TTCCGGCTGG 
3 241 CTGGTTTATT GCTGATAAAT CTGGAGCCGG TGAGCGTGGG TCTCGCGGTA TCATTGCAGC 
3 3 01 ACTGGGGCCA GATGGTAAGC CCTCCCGTAT CGTAGTTATC TACACGACGG GGAGTCAGGC 
3 361 AACTATGGAT GAACGAAATA GACAGATCGC TGAGATAGGT GCCTCACTGA TTAAGCATTG 
3421 GTAACTGTCA GACCAAGTTT ACTCATATAT ACTTTAGATT GATTTAAAAC TTCATTTTTA 
34 81 ATTTAAAAGG ATCTAGGTGA AGATCCTTTT TGATAATCTC ATGACCAAAA TCCCTTAACG 
3 541 TGAGTTTTCG TTCCACTGAG CGTCAGACCC CGTAGAAAAG ATCAAAGGAT CTTCTTGAGA 
3 6 01 TCCTTTTTTT CTGCGCGTAA TCTGCTGCTT GCAAACAAAA AAACCACCGC TACCAGCGGT 
3 661 GGTTTGTTTG CCGGATCAAG AGCTACCAAC TCTTTTTCCG AAGGTAACTG GCTTCAGCAG 
3 721 AG CG C AG ATA CCAAATACTG TCCTTCTAGT GTAGCCGTAG TTAGGCCACC ACTTCAAGAA 
3 781 CTCTGTAGCA CCGCCTACAT ACCTCGCTCT GCTAATCCTG TTACCAGTGG CTGCTGCCAG 
3 841 TGGCGATAAG TCGTGTCTTA CCGGGTTGGA CTCAAGACGA TAGTTACCGG ATAAGGCGCA 
3 901 GCGGTCGGGC TGAACGGGGG GTTCGTGCAC ACAGCCCAGC TTGGAGCGAA CGACCTACAC 

3 961 CGAACTGAGA TACCTACAGC GTGAGCTATG AGAAAGCGCC ACGCTTCCCG AAGGGAGAAA 

4 021 GGCGGACAGG TATCCGGTAA GCGGCAGGGT CGGAACAGGA GAGCGCACGA GGGAGCTTCC 
4 081 AGGGGGAAAC GCCTGGTATC TTTATAGTCC TGTCGGGTTT CGCCACCTCT GACTTGAGCG 
4141 TCGATTTTTG TGATGCTCGT CAGGGGGGCG GAGCCTATGG AAAAACGCCA GCAACGCGGC 
42 01 CTTTTTACGG TTCCTGGCCT TTTGCTGGCC TTTTGCTCAC ATGTTCTTTC CTGCGTTATC 
4 261 CCCTGATTCT GTGGATAACC GTATTACCGC CTTTGAGTGA GCTGATACCG CTCGCCGCAG 
4321 CCGAACGACC GAGCGCAGCG AGTCAGTGAG CGAGGAAGCG GAAGAGCGCC TGATGCGGTA 
4 381 TTTTCTCCTT ACGCATCTGT GCGGTATTTC ACACCG CATA TATGGTGCAC TCTCAGTACA 
4 441 ATCTGCTCTG ATGCCGCATA GTTAAGCCAG TATACACTCC GCTATCGCTA CGTGACTGGG 
4 501 TCATGGCTGC GCCCCGACAC CCGCCAACAC CCGCTGACGC GCCCTGACGG GCTTGTCTGC 
4 561 TCCCGGCATC CGCTTACAGA CAAGCTGTGA CCGTCTCCGG GAGCTGCATG TGTCAGAGGT 
4 621 TTTCACCGTC ATCACCGAAA CGCGCGAGGC AGCTGCGGTA AAGCTCATCA GCGTGGTCGT 
4 681 GAAGCGATTC ACAGATGTCT GCCTGTTCAT CCGCGTCCAG CTCGTTGAGT TTCTCCAGAA 
4 741 GCGTTAATGT CTGGCTTCTG ATAAAGCGGG CCATGTTAAG GGCGGTTTTT TCCTGTTTGG 
4 8 01 TCACTGATGC CTCCGTGTAA GGGGGATTTC TGTTCATGGG GGTAATGATA CCGATGAAAC 
4 861 GAGAGAGGAT GCTCACGATA CGGGTTACTG ATGATGAACA TGCCCGGTTA CTGGAACGTT 
4 921 GTGAGGGTAA ACAACTGGCG GTATGGATGC GGCGGGACCA GAGAAAAATC ACTCAGGGTC 

4 981 AATGCCAGCG CTTCGTTAAT AC AGATG TAG GTGTTCCACA GGGTAGCCAG CAGCATCCTG 

5 041 CGATG CAGAT C CGGAAC ATA ATGGTGCAGG GCGCTGACTT CCGCGTTTCC AGACTTTACG 
5101 AAACACGGAA ACCGAAGACC ATTCATGTTG TTGCTCAGGT CGCAGACGTT TTGCAGCAGC 
5161 AGTCGCTTCA CGTTCGCTCG CGTATCGGTG ATTCATTCTG CTAACCAGTA AGGCAACCCC 
5221 G C C AG C CT AG CCGGGTCCTC AACGACAGGA GCACGATCAT GCGCACCCGT GGCCAGGACC 
52 81 CAACGCTGCC CGAGATGCGC CGCGTGCGGC TGCTGGAGAT GGCGGACGCG ATGGATATGT 
5341 TCTGCCAAGG GTTGGTTTGC GCATTCACAG TTCTCCGCAA GAATTGATTG GCTCCAATTC 
'5-4 01 TTGGAGTGGT GAATCCGTTA GCGAGGTGCC GCCGGCTTCC ATTCAGGTCG AGGTGGCCCG 

54 61 GCTCCATGCA CCGCGACGCA ACGCGGGGAG G C AG AC AAGG TATAGGGCGG CGCCTACAAT 
5521 CCATGCCAAC CCGTTCCATG TGCTCGCCGA GGCGGCATAA ATCGCCGTGA CGATC AG CGG 

55 8 1> TCCAGTGATC GAAGTTAGGC TGGTAAGAGC CGCGAGCGAT CCTTGAAGCT GTCCCTGATG 
5641 GTCGTCATCT ACCTGCCTGG ACAGCATGGC CTGCAACGCG GGCATCCCGA TGCCGCCGGA 
57 01 AG C G AG AAG A ATCATAATGG GGAAGG C CAT CCAGCCTCGC GTCGCGAACG CCAGCAAGAC 
5761 GTAGCCCAGC GCGTCGGCCG CCATGCCGGC GATAATGGCC TGCTTCTCGC CGAAACGTTT 
5821 GGTGGCGGGA C CAGTGACG A AGGCTTGAGC GAGGGCGTGC AAGATTCCGA AT AC C G C AAG 
5881 CGACAGGCCG ATCATCGTCG CGCTCCAGCG AAAGCGGTCC TCGCCGAAAA TGACCCAGAG 

5 941 CGCTGCCGGC ACCTGTCCTA CGAGTTGCAT GATAAAGAAG ACAGTCATAA GTGCGGCGAC 

6 001 GATAGTCATG CCCCGCGCCC ACCGGAAGGA GCTGACTGGG TTGAAGGCTC TCAAGGGCAT 
6 061 CGGTCGATCG ACGCTCTCCC TTATGCGACT C C TG C ATT AG G AAG C AG C C C AGTAGTAGGT 
6121 TGAGGCCGTT GAGCACCGCC GCCGCAAGGA ATGGTGCATG CAAGGAGATG GCGCCCAACA- 




WO 00/52027 sJO U i PCT/US00/05432 



6181 GTCCCCCGGC CACGGGGCCT GCCACCATAC CCACGCCGAA ACAAGCGCTC ATGAGCCCGA 
6241 AGTGGCGAGC CCGATCTTCC CCATCGGTGA TGTCGGCGAT ATAGGCGCCA GCAACCGCAC 
63 01 CTGTGGCGCC GGTGATGCCG GCCACGATGC GTCCGGCGTA GAGGATCGAG ATCT 
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pDESTIS 6613 bp 

Location (Base Nos. ) Gene Encoded 

474 . . 1449 ampR 

1590 . .2244 ori 

2738 . . 3850 genR 

4251. .4127 attRl 

4501 . . 5160 CmR 

5280.. 5364 inactivated ccdA 

5502 . . 5807 ccdB 

5848 . .5972 attR2 

6595. .25 lacZ 

1 GACGCGCCCT GTAGCGGCGC ATTAAGCGCG GCGGGTGTGG TGGTTACGCG CAGCGTGACC 
61 GCTACACTTG CCAGCGCCCT AGCGCCCGCT CCTTTCGCTT TCTTCCCTTC CTTTCTCGCC 
121 ACGTTCGCCG GCTTTCCCCG TCAAGCTCTA AATCGGGGGC TCCCTTTAGG GTTCCGATTT 
181 AGTG CTTTAC GGCACCTCGA CCCCAAAAAA CTTGATTAGG GTGATGGTTC ACGTAGTGGG 
241 CCATCGCCCT GATAGACGGT TTTTCGCCCT TTGACGTTGG AGTCCACGTT CTTTAATAGT 
3 01 GGACTCTTGT TCCAAACTGG AACAACACTC AACCCTATCT CGGTCTATTC TTTTGATTTA 
361 TAAGGGATTT TGCCGATTTC GGCCTATTGG TTAAAAAATG AGCTGATTTA ACAAAAATTT 
421 AACGCGAATT TTAACAAAAT ATTAACGTTT ACAATTTCAG GTGGCACTTT TCGGGGAAAT 
481 GTGCGCGGAA CCCCTATTTG TTTATTTTTC TAAATACATT CAAATATGTA TCCGCTCATG 
541 AGACAATAAC CCTGATAAAT GCTTCAATAA TATTGAAAAA GGAAGAGTAT GAGTATTCAA 
6 01 CATTTCCGTG TCGCCCTTAT TCCCTTTTTT GCGGCATTTT GCCTTCCTGT TTTTG CTC AC 
6 61 CCAGAAACGC TGGTGAAAGT AAAAGATGCT GAAGATCAGT TGGGTGCACG AGTGGGTTAC 
721 ATCGAACTGG AT CTCAACAG CGGTAAGAT C CTTGAGAGTT TTCGCCCCGA AG AAC GTTTT 
781 CCAATGATGA GCACTTTTAA AGTTCTGCTA TGTGGCGCGG TATTATCCCG TATTGACGCC 
841 GGG CAAGAG C AACTCGGTCG CCGCATACAC TATTCTCAGA ATGACTTGGT TGAGTACTCA 
901 CCAGTCACAG AAAAGCATCT TACGGATGGC ATGACAGTAA GAGAATTATG CAGTGCTGCC 
961 ATAAC CATGA GTGATAACAC TGCGGCCAAC TTACTTCTGA CAACGATCGG AGGACCGAAG 
1021 GAGCTAACCG CTTTTTTGCA CAACATGGGG GATCATGTAA CTCGCCTTGA TCGTTGGGAA 
1081 CCGGAGCTGA ATGAAG C CAT ACCAAACGAC GAGCGTGACA CCACGATGCC TGTAGCAATG 
1141 GCAACAACGT TGCGCAAACT ATTAACTGGC GAACTACTTA CTCTAGCTTC CCGGCAACAA 
12 01 TTAAT AG AC T GGATGGAGGC GGATAAAGTT GCAGGACCAC TTCTGCGCTC GGCCCTTCCG 

12 61 GCTGGCTGGT TTATTGCTGA TAAATCTGGA GCCGGTGAGC GTGGGTCTCG CGGTATCATT 
1321 GCAG CACTGG GGCCAGATGG TAAGCCCTCC CGTATCGTAG TTAT CTACAC GACGGGGAGT 

13 81 CAGGCAACTA TGGATGAACG AAATAGACAG ATCGCTGAGA TAGGTGCCTC ACTGATTAAG 

14 41 CATTGGTAAC TGTCAGACCA AGTTTACTCA TATATAC TTT AGATTGATTT AAAACTTCAT 

15 01 TTTTAATTTA AAAGGATCTA GGTGAAGATC CTTTTTGATA ATCTCATGAC CAAAATCCCT 
1561 TAACGTGAGT TTTCGTTCCA CTGAGCGTCA GACCCCGTAG AAAAGATCAA AGGATCTTCT 
1621 TGAGATCCTT TTTTTCTGCG CGTAATCTGC TG CTTG C AAA CAAAAAAACC ACCGCTACCA 

16 81 GCGGTGGTTT GTTTGC CGGA TCAAGAGCTA CCAACTCTTT TTCCGAAGGT AACTGGCTTC 
1741 AGCAGAGCGC AG ATAC C AAA TACTGTCCTT CTAGTGTAGC CGTAGTTAGG CCACCACTTC 
18 01 AAGAACTCTG TAGCACCGCC TACATACCTC GCTCTGCTAA TCCTGTTACC AGTGGCTGCT 
18 61 GCCAGTGGCG ATAAGTCGTG TCTTACCGGG TTGGACTCAA G AC G AT AGTT ACCGGATAAG 
1921 GCGCAGCGGT CGGGCTGAAC GGGGGGTTCG TGCACAC AG C C C AG CTTGGA G C G AAC G AC C 
1981 1 TACACCGAAC TGAGATACCT ACAGCGTGAG CATTGAGAAA GCGCCACGCT TCCCGAAGGG 
2 041 AGAAAGGCGG ACAGGTATCC GGTAAGCGGC AGGGTCGGAA CAGGAGAGCG CACGAGGGAG 
2101 CTTCCAGGGG GAAACGCCTG GTATCTTTAT AGTCCTGTCG GGTTTCGCCA CCTCTGACTT 
2161 GAGCGTCGAT TTTTGTGATG CTCGTCAGGG GGGCGGAGCC TATGGAAAAA CGCCAGCAAC 
2 221 GCGGCCTTTT TACGGTTCCT GGCCTTTTGC TGGCCTTTTG CTCACATGTT CTTTCCTGCG 
2281 TTATCCCCTG ATTCTGTGGA TAACCGTATT ACCGCCTTTG AGTG AG CTG A TACCGCTCGC 
2 341 CGCAGCCGAA CGACCGAGCG CAGCGAGTCA GTGAGCGAGG AAGCGGAAGA GCGCCTGATG 
24 01 CGGTATTTTC TCCTTACGCA TCTGTGCGGT ATTTCAC AC C G C AG AC C AG C CGCGTAACCT 
2461 GGCAAAATCG GTTACGGTTG AGTAATAAAT GGATGCCCTG CGTAAGCGGG TGTGGGCGGA- 
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2 521 CAATAAAGTC TTAAACTGAA CAAAATAGAT CTAAACTATG ACAATAAAGT CTTAAA 3TAG 
2 58 1 AC AG AAT AG T TGTAAACTGA AATCAGTCCA GTTATGCTGT G AAAAAG CAT ACTGGACTTT 
2 641 TGTTATGGCT AAAGCAAACT CTTCATTTTC TGAAGTGCAA ATTGCCCGTC 3TATTAAAGA 
27 01 GGGGCGTGGC CAAGGGCATG GTAAAGACTA TATTCGCGGC GTTGTGACAA TTTACC3AAC 
2761 AACTCCGCGG CCGGGAAGCC GATCTCGGCT TGAACGAATT GTTAGGTGGC 3GTACTTGGG 
2 821 TCGATATCAA AGTGCATCAC TTCTTCCCGT ATGCCCAACT TTG TATAGAG AGCCACTGCG 

2 881 GGATCGTCAC CGTAATCTGC TTGCACGTAG AT C AC AT AAG CACCAAGCGC GTTGGCCTCA 
2941 TGCTTGAGGA GATTGATGAG CGCGGTGGCA ATGCCCTGCC TCCGGTGCTC GCCGGA3ACT 

3 001 G CG AGAT CAT AGATATAGAT CTCACTACGC GGCTGCTCAA ACCTGGGCAG AACGTAAGCC 
3 061 GCGAGAGCGC CAACAACCGC TTCTTGGTCG AAGGCAGCAA GCGCGATGAA TGTCTTACTA 
3121 CGGAG CAAGT TCCCGAGGTA ATCGGAGTCC GGCTGATGTT GGGAGTAGGT GGCTAC3TCT 
3181 CCGAACTCAC GACCGAAAAG ATCAAGAGCA GCCCGCATGG ATTTGACTTG GTCAGG3CCG 
3 241 AG C CTACATG TGCGAATGAT GCCCATACTT GAG C C AC C T A ACTTTGTTTT AGGGCGACTG 
33 01 CCCTGCTGCG TAACATCGTT GCTGCTGCGT AACATCGTTG CTGCTCCATA ACATCAAACA 

33 61 TCGACCCACG GCGTAACGCG CTTGCTGCTT GGATGCCCGA GGCATAGACT GTACAAAAAA 
3 421 ACAGTCATAA CAAGCCATGA AAACCGCCAC TGCGCCGTTA CCACCGCTGC GTTCGGTCAA 

34 81 GGTTCTGGAC CAGTTGCGTG AGCGCATACG CTACTTGCAT TACAGTTTAC GAACCGAACA 
3 541 GGCTTATGTC AACTGGGTTC GTGCCTTCAT CCGTTTCCAC GGTGTGCGTC ACCCGG3AAC 
3 6 01 CTTGGGCAGC AGCGAAGTCG AGGCATTTCT GTCCTGGCTG GCGAACGAGC GCAAGGTTTC 

36 61 GGTCTCCACG CATCGTCAGG CATTGGCGGC CTTGCTGTTC TTCTACGGCA AGGTGCTGTG 
3 721 CACGGATCTG CCCTGGCTTC AGGAGATCGG AAGACCTCGG CCGTCGCGGC GCTTGCCGGT 

37 81 GGTGCTGACC CCGGATGAAG TGGTTCGCAT CCTCGGTTTT CTGGAAGGCC- AGCATC3TTT 
3 841 GTTCGCCCAG GACTCTAGCT ATAGTTCTAG TGGTTGGCTA CGTATCGAGC AAGAAAATAA 
3 9 01 AACGCCAAAC GCGTTGGAGT CTTGTGTGCT ATTTTTACAA AGATTCAGAA ATAC G CATC A 

3 961 CTTACAACAA GGGGGACTAT GAAATTATGC ATTTTGAGGA TGCCGGGACC TTTAATTCAA 

4 021 CCCAACACAA TATATTATAG TTAAATAAGA ATTATTT AT C AAATCATTTG TATATTAATT 
4 081 AAAATACTAT ACTGTAAATT ACATTTTATT TACAATGAGG ATCATCACAA GTTTGTACAA 
4141 AAAAGCTGAA CGAGAAACGT AAAAT GAT AT AAATATCAAT ATATTAAATT AGATTTTGCA 
42 01 TAAAAAACAG ACTACATAAT ACTGTAAAAC AC AAC AT AT C CAGTCACTAT GGCGGC3GCT 

42 61 AAGTTGGCAG CATCACCCGA CGCACTTTGC G C C G AAT AAA TACCTGTGAC G G AAG AT C AC 
4 321 TTCGCAGAAT AAATAAATCC TGGTGTCCCT GTTGATACCG GGAAGCCCTG GGCCAACTTT 

43 81 TGGCGAAAAT GAGACGTTGA TCGGCACGTA AGAGGTTCCA ACTTTCACCA T AAT G AAATA 
4 441 AGATCACTAC CGGGCGTATT TTTTGAGTTA TCGAGATTTT CAGGAGCTAA GGAAG CTAAA 
4 501 ATGGAGAAAA AAATCACTGG ATATAC C AC C GTTGATATAT CCCAATGGCA TCGTAAAGAA 
4 561 CATTTTGAGG CATTTCAGTC AGTTGCTCAA TGTACCTATA ACCAGACCGT TCAGCT3GAT 
4 621 ATTACGGCCT TTTTAAAGAC CGTAAAGAAA AAT AAG C AC A AGTTTTATCC GGCCTTTATT 
4681 CACATTCTTG CCCGCCTGAT GAATGCTCAT CCGGAATTCC GTATGGCAAT GAAAGACGGT 
4 741 GAGCTGGTGA TATGGGATAG TGTTCACCCT TGTTACACCG TTTTC CATG A GCAAACTGAA 
4801 ACGTTTTCAT CGCTCTGGAG TGAATACCAC GACGATTTCC GGCAGTTTCT ACACATATAT 
48 61 TCGCAAGATG TGGCGTGTTA CGGTGAAAAC CTGGCCTATT TCCCTAAAGG GTTTATTGAG 
4 921 AATATGTTTT TCGTCTCAGC CAATCCCTGG GTGAGTTTCA CCAGTTTTGA TTTAAACGTG 

4 981 GCCAATATGG AC AAC TT C TT CGCCCCCGTT TTCACCATGG G C AAATATT A TACGCAAGGC 

5 041 GACAAGGTGC TGATGCCGCT GGCGATTCAG GTTCATCATG CCGTCTGTGA TGGCTTCCAT 
5101 GTCGGCAGAA TGCTTAATGA ATTACAACAG TACTGCGATG AGTGGCAGGG CGGGGC3TAA 
5161 ACGCGTGGAT CCGGCTTACT AAAAG C C AG A TAACAGTATG CGTATTTGCG CGCTGATTTT 
52 21 TGCGGTATAA GAATATATAC TGATATGTAT ACCCGAAGTA TGTCAAAAAG AGGTGT3CTA 

52 81 TG AAG C AG C G TATTACAGTG ACAGTTGACA GCGACAGCTA TCAGTTGCTC AAGGCATATA 

53 41 TGATGTCAAT ATCTCCGGTC TGGTAAGCAC AAC CATGCAG AATGAAGCCC GTCGTCTGCG 

54 01- TGCCGAACGC TGGAAAGCGG AAAAT C AGG A AGGGATGGCT GAGGTCGCCC GGTTTATTGA 
54 61 AATGAACGGC TCTTTTGCTG AC GAG AAC AG GGACTGGTGA AATGCAGTTT AAGGTTTACA 
5 521 C CT AT AAAAG AG AGAGC CGT TATCGTCTGT TTGTGGATGT ACAGAGTGAT ATTATT 3 AC A 
5581 CGCCCGGGCG ACGGATGGTG ATCCCCCTGG CCAGTGCACG TCTGCTGTCA 3ATAAA3TCT 
5 641 CCCGTGAACT TTACCCGGTG GTGCATATCG GGGATGAAAG CTGGCGCATG ATGACCACCG 

57 01 ATATGGCCAG TGTGCCGGTC TCCGTTATCG GGGAAGAAGT GGCTGATCTC AGCCACTGCG 
5 761 AAAATGACAT CAAAAACGCC ATT AAC C TG A TGTTCTGGGG AAT AT AAAT G TCAGGCTCCC 
5821 TT ATAC AC AG CCAGTCTGCA GGTCGACCAT AGTGACTGGA TATGTTGTGT TTTACA3TAT 

58 81 TATGTAGTCT GTTTTTTATG CAAAATCTAA TTTAATATAT TGATATTTAT ATCATTTTAC 
5 94 1 GTTTCTCGTT CAGCTTTCTT GTACAAAGTG G TG AT AG C TT GTCGAGAAGT ACTAGA.3GAT - 
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60 01 CATAATCAGC CATACCACAT TTGTAGAGGT TTTACTTGCT TTAAAAAACC TCCCACACCT 

6 061 CCCCCTGAAC CTGAAACATA AAATGAATGC AATTGTTGTT GTTAACTTGT TTATT G C AG C 

6121 TTATAATGGT TACAAATAAA GCAATAGCAT CACAAATTTC ACAAATAAAG CATTTTTTTC 

6181 ACTGCATTCT AGTTGTGGTT TGTCCAAACT CATCAATGTA TCTTATCATG TCTGGATCTG 

62 41 ATCACTGCTT GAGCCTAGGA GATCCGAACC AGATAAGTGA AATCTAGTTC CAAACTATTT 

63 01 TGTCATTTTT AATTTTCGTA TTAGCTTACG ACGCTACACC CAGTTCCCAT CTATTTTGTC 

63 61 ACTCTTCCCT AAATAATCCT TAAAAACTCC ATTTCCACCC CTCCCAGTTC CCAACTATTT 
6421 TGTCCGCCCA CAGCGGGGCA TTTTTCTTCC TGTTATGTTT TTAATCAAAC ATCCTGCCAA 

64 81 CTCCATGTGA CAAACCGTCA TCTTCGGCTA CTTTTTCTCT GTCACAGAAT GAAAATTTTT 
6541 CTGTCATCTC TTCGTTATTA ATGTTTGTAA TTGACTGAAT ATCAACG CTT ATTTGCAGCC 
66 01 TGAATGGCGA ATG 
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2ko 



61 



121 



ggtgacgccg tcatctttcc attgtaacgt aaatggcaac ttgtagatga acgcgctgtc 
ccactgcggc agtagaaagg taacattgca tttaccgttg aacatctact tgcgcgacag 



aaaaaaccgg ccagtttctt ccacaaactc 
ttttttggcc ggtcaaagaa ggtgtttgag 



^gcaacaatcg cgatgacctc gtggtatgga 
j f cgrttgttagc gctactggag caccatacct 



gcgcacggct gtctcgtaaa cttttgcgtc 
cgcgtgccga cagagcattt gaaaacgcag 



aattttttct aaaaaagtgt cgttcatgtc" 
ttaaaaaa ga ttttt tca ca gcaagtacag^. 



181 



ih 



ggcggcggcg ttcgcgctcc ggrtacgcgcg 
ccgccgccgc aagcgcgagg csatgcgcgc 



241 



ctcgattatc ataaacaatc ctgcaggcat 
gagctaatag tatttgttag gacgtccgta 



acgggcacac agcaggacag ccttgtccgg 
tgcccgtgtg tcgtc ctgtc^ ggaacaggcc 

gcaagctgga tcatcacaag ttt jgtacaaaT 
cgttcgacct agtag tgttc aaacatgtt^ 
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pDEST19 6668 bp (rotated to position 1000) 



Location (Base Nos . ) 
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Gene Encoded 
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attR2 
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genR 



1 AGTGGTTCGC ATCCTCGGTT TTCTGGAAGG C GAG CATC GT TTGTTCGCCC AGGACTCTAG 
61 CTATAGTTCT AGTGGTTGGC TACGTATATC AAATACTTGT AGGTGACGCC GTCATCTTTC 
121 CATTGTAACG TAAATGGCAA CTTGTAGATG AACGCGCTGT CAAAAAACCG GCCAGTTTCT 
181 TCCACAAACT CGCGCACGGC TGTCTCGTAA ACTTTTGCGT CGCAACAATC GCGATGACCT 
241 CGTGGTATGG AAATTTTTTC TAAAAAAGTG TCGTTCATGT CGGCGGCGGG CGCGTTCGCG 
3 01 CTCCGGTACG CGCGACGGGC ACACAGCAGG ACAGCCTTGT CCGGCTCGAT TATCATAAAC 
361 AATCCTGCAG GCATGCAAGC TCGGATCATC ACAAGTTTGT ACAAAAAAGC TGAACGAGAA 
421 ACGTAAAATG ATATAAATAT CAATATATTA AATTAGATTT TGCATAAAAA ACAGACTACA 
481 TAATACTGTA AAACACAACA TATC CAGTCA CTATGGCGGC CGCTAAGTTG GCAGCATCAC 
541 CCGACGCACT TTGCGCCGAA TAAATACCTG TGACGGAAGA TCACTTCGCA GAATAAATAA 
6 01 ATCCTGGTGT CCCTGTTGAT ACCGGGAAGC CCTGGGCCAA CTTTTGGCGA AAATGAGACG 
661 TTGATCGGCA CGTAAGAGGT TCCAACTTTC AC C AT AATGA AATAAGATCA CTACCGGGCG 
721 TATTTTTTGA GTTATCGAGA TTTTCAGGAG CTAAGGAAGC T AAAATG GAG AAAAAAATCA 
781 C TGGAT AT AC CACCGTTGAT ATATCCCAAT GGCATCGTAA AGAACATTTT GAGG C ATTTC 
841 AGTCAGTTGC TCAATGTACC TATAACCAGA CCGTTCAGCT GGATATTACG GCCTTTTTAA 
901 AGAC CGTAAA GAAAAATAAG CACAAGTTTT ATCCGGCCTT TATTCACATT CTTGCCCGCC 
961 TGATGAATGC TCATCCGGAA TTCCGTATGG CAATGAAAGA CGGTGAGCTG GTGATATGGG 
1021 ATAGTGTTCA CCCTTGTTAC ACCGTTTTCC ATGAGCAAAC TGAAACGTTT TCATCGCTCT 
1081 GGAGTGAATA CCACGACGAT TTCCGGCAGT TTCTACACAT ATATTCGCAA GATGTGGCGT 
1141 GTTACGGTGA AAACCTGGCC TATTTCCCTA AAGGGTTTAT TGAGAATATG TTTTTCGTCT 
12 01 CAGCCAATCC CTGGGTGAGT TTCACCAGTT TTGATTTAAA CGTGGCCAAT ATGGACAACT 

12 61 TCTTCGCCCC CGTTTTCACC ATGGGCAAAT ATTATACGCA AGGCGACAAG GTGCTGATGC 
1321 CGCTGGCGAT TCAGGTTCAT CATGCCGTCT GTGATGGCTT CCATGTCGGC AGAATGCTTA 

13 81 ATGAATTACA ACAGTACTGC GATGAGTGGC AGGGCGGGGC GTAAACGCGT GGATCCGGCT 
1441 T ACT AAAAG C CAGATAACAG TATGCGTATT TGCGCGCTGA TTTTTGCGGT ATAAGAATAT 

15 01 ATACTGATAT GTATACCCGA AGTATGTCAA AAAGAGGTGT GCTATGAAGC AGCGTATTAC 
1561 AGTGACAGTT GACAG CGACA GCTATCAGTT GCTCAAGGCA TATATGATGT CAATATCTCC 
1621 GGTCTGGTAA G C AC AAC CAT GCAGAATGAA GCCCGTCGTC TGCGTGCCGA ACGCTGGAAA 

16 81 GCGGAAAATC AGGAAGGGAT GGCTGAGGTC GCCCGGTTTA TTGAAATGAA CGGCTCTTTT 
1741 GCTGACGAGA ACAGGGACTG GTGAAATGCA GTTTAAGGTT TACACCTATA AAAGAGAGAG 
18 01 CCGTTATCGT CTGTTTGTGG ATGTACAGAG TGATATTATT GACACGCCCG GGCGACGGAT 
,18 61 GGTGATCCCC CTGGCCAGTG CACGTCTGCT GTCAGATAAA GTCTCCCGTG AACTTTACCC 

1921 GGTGGTGCAT ATCGGGGATG AAAGCTGGCG CATGATGACC AC C G ATATGG CCAGTGTGCC 
1981 GGTCTCCGTT ATCGGGGAAG AAGTGGCTGA TCTCAGCCAC CGCGAAAATG ACATCAAAAA 
2 041 CGCCATTAAC CTGATGTTCT GGGGAATATA AATGTCAGGC TCCCTTATAC ACAGCCAGTC 
2101* TGCAGGTCGA C C AT AGTG AC TGGATATGTT GTGTTTTACA GTATTATGTA GTCTGTTTTT 
2161 TATGCAAAAT CTAATTTAAT ATATTGATAT TTAT AT C ATT TTACGTTTCT CGTTCAGCTT 
2221 TCTTGTACAA AGTGGTGATC GAGAAGTACT AGAGGATCAT AATCAG C CAT AC CAC ATTTG 
22 81 TAGAGGTTTT ACTTGCTTTA AAAAACCTCC CACACCTCCC CCTGAACCTG AAACATAAAA 
2 341 TGAATGCAAT TGTTGTTGTT AACTTGTTTA TTGCAGCTTA TAATGGTTAC AAATAAAG C A 
24 01 ATAGCATCAC AAATTTCACA AATAAAG CAT TTTTTT CAC T GCATTCTAGT TGTGGTTTGT 
2 461 C C AAAC T CAT CAATGTATCT TATCATGTCT GGATCTGATC ACTG CTTG AG CCTAGGAGAT 
2 521 C C G AAC C AG A TAAGTGAAAT CTAGTTCCAA ACTATTTTGT CATTTTTAAT TTTCGTATTA 
2 581 GCTTACGACG CTACACCCAG TTCCCATCTA TTTTGTCACT CTTCCCTAAA TAATCCTTAA- 
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2 641 AAACTCCATT TCCACCCCTC CCAGTTCCCA ACTATTTTGT CCGCCCACAG CGGGGCATTT 
27 01 TTCTTCCTGT TATGTTTTTA ATCAAACATC CTGCCAACTC CATGTGACAA ACCGTCATCT 
2761 TCGGCTACTT TTTCTCTGTC ACAGAATGAA AATTTTTCTG TCATCTCTTC GTTATTAATG 
2 821 TTTGTAATTG ACTGAATATC AACGCTTATT TGCAGCCTGA ATGG CGAATG GACGCGCCCT 
2 881 GTAGCGGCGC ATTAAGCGCG GCGGGTGTGG TGGTTACGCG CAGCGTGACC GCTACACTTG 

2 941 CCAGCGCCCT AGCGCCCGCT CCTTTCGCTT TCTTCCCTTC CTTTCTCGCC ACGTTCGCCG 

3 001 GCTTTCCCCG TCAAGCTCTA AATCGGGGGC TCCCTTTAGG GTTCCGATTT AGTGCTTTAC 
3 061 GGCACCTCGA CCCCAAAAAA CTTGATTAGG GTGATGGTTC ACGTAGTGGG CCATCGCCCT 
3121 GATAGACGGT TTTTCGCCCT TTGACGTTGG AGTCCACGTT CTTTAATAGT GGACTCTTGT 
3181 TCCAAACTGG AACAACACTC AACCCTATCT CGGTCTATTC TTTTGATTTA TAAGGGATTT 
3241 TGCCGATTTC GGCCTATTGG TTAAAAAATG AGCTGATTTA ACAAAAATTT AACGCGAATT 
3 3 01 TTAACAAAAT ATTAACGTTT ACAATTTCAG GTGGCACTTT TCGGGGAAAT GTGCGCGGAA 
3361 CCC CTATTTG TTTATTTTTC TAAATACATT CAAATATGTA TCCGCTCATG AGACAATAAC 
3421 CCTGATAAAT GCTTCAATAA TATTGAAAAA GGAAGAGTAT GAGTATTCAA CATTTCCGTG 
34 81 TCGCCCTTAT TCCCTTTTTT GCGGCATTTT GCCTTCCTGT TTTTG CTCAC CCAGAAACGC 
3 541 TGGTGAAAGT AAAAGATGCT GAAGATCAGT TGGGTGCACG AGTGGGTTAC ATCGAACTGG 
36 01 ATCTCAACAG CGGTAAGATC CTTGAGAGTT TTCGCCCCGA AGAAC GTTTT C C AATGATGA 
36 61 GCACTTTTAA AGTTCTGCTA TGTGGCGCGG TATTATCCCG TATTGACGCC GGGCAAGAGC 
3 721 AACTCGGTCG CCGCATACAC TATTCTCAGA ATGACTTGGT TGAGTACTCA CCAGTCACAG 
3 781 AAAAGCATCT TACGGATGGC ATGACAGTAA GAGAATTATG CAGTGCTGCC ATAAC C ATGA 
38 41 GTGATAACAC TGCGGCCAAC TTACTTCTGA CAACGATCGG AGGACCGAAG GAGCTAACCG 
3 9 01 CTTTTTT G C A CAACATGGGG GATCATGTAA CTCGCCTTGA TCGTTGGGAA CCGGAGCTGA 

3 961 ATGAAG C CAT AC C AAAC G AC GAGCGTGACA CCACGATGCC TGTAGCAATG GCAACAACGT 

4 021 TGCG CAAACT ATTAACTGGC GAACTACTTA CTCTAGCTTC CCGGCAACAA TTAATAGACT 
4 081 GGATGGAGGC GGATAAAGTT GCAGGACCAC TTCTGCGCTC GGCCCTTCCG GCTGGCTGGT 
4141 TTATTGCTGA TAAATCTGGA GCCGGTGAGC GTGGGTCTCG CGGTATCATT GCAGCACTGG 

42 01 GGCCAGATGG TAAGCCCTCC CGTATCGTAG TTATCTACAC GACGGGGAGT CAGGCAACTA 
4 261 TGGATGAACG AAATAGACAG ATCGCTGAGA TAGGTGCCTC ACTGATTAAG CATTGGTAAC 
4 321 TGTCAGACCA AGTTTACTCA TATATACTTT AGATTGATTT AAAACTTCAT TTTTAATTTA 

43 81 AAAGGATCTA GGTGAAGATC CTTTTTGATA ATCTCATGAC CAAAATCCCT TAACGTGAGT 

44 4 1 TTTCGTTCCA CTGAGCGTCA GACCCCGTAG AAAAGATCAA AGGATCTTCT TGAGATCCTT 

45 01 TTTTTCTGCG CGTAATCTGC TGCTTGCAAA CAAAAAAACC ACCGCTACCA GCGGTGGTTT 
4 561 GTTTGCCGGA TCAAGAGCTA CCAACTCTTT TTCCGAAGGT AACTGGCTTC AG C AG AG C G C 
4 621 AGATAC C AAA TACTGTCCTT CTAGTGTAGC CGTAGTTAGG CCACCACTTC AAGAACTCTG 
4 681 TAGCACCGCC TACATAC CTC GCTCTG CTAA TCCTGTTACC AGTGGCTGCT GCCAGTGGCG 
4 741 ATAAGTCGTG TCTTACCGGG TTGGACTCAA G AC G ATAGTT AC CGG ATAAG GCGCAGCGGT 
48 01 CGGGCTGAAC GGGGGGTTCG TG C AC AC AG C CCAGCTTGGA GCGAACGACC TACACCGAAC 
4861 TGAGATACCT ACAGCGTGAG CATTGAGAAA GCGCCACGCT TCCCGAAGGG AGAAAGG CGG 
4 921 AC AGGT AT C C GGTAAGCGGC AGGGTCGGAA CAGGAGAGCG CACGAGGGAG CTTCCAGGGG 

4 981 GAAACGCCTG GTATCTTTAT AGTCCTGTCG GGTTTCGCCA CCTCTGACTT GAGCGTCGAT 

5 041 TTTTGTGATG CTCGTCAGGG GGGCGGAGCC TATGGAAAAA CGCCAGCAAC GCGGCCTTTT 
5101 TACGGTTCCT GGCCTTTTGC TGGCCTTTTG CTCACATGTT CTTTCCTGCG TTATCCCCTG 
5161 ATTCTGTGGA TAACCGTATT ACCGCCTTTG AGTGAGCTGA TACCGCTCGC CGCAGCCGAA 
5221 CGACCGAGCG CAGCGAGTCA GTGAGCGAGG AAGCGGAAGA GCGCCTGATG CGGTATTTTC 
52 81 TCCTTACGCA TCTGTGCGGT ATTT C AC AC C G C AG AC C AG C CGCGTAACCT GGCAAAATCG 
5 341 GTTACGGTTG AGTAATAAAT GGATGCCCTG CGTAAGCGGG TGTGGGCGGA C AATAAAGT C 
54 01 TTAAACTGAA CAAAATAGAT CTAAACTATG AC AATAAAGT CTTAAACTAG ACAGAATAGT 

54 61 TGTAAACTGA AATCAGTCCA GTTATG CTGT G AAAAAG CAT ACTGGACTTT TGTTATGGCT 
5521 AAAG CAAACT CTTCATTTTC TGAAGTGCAA ATTGCCCGTC GTATTAAAGA GGGGCGTGGC 

55 81 CAAGGGCATG GTAAAGACTA TATTCGCGGC GTTGTGACAA TTTACCGAAC AACTCCGCGG 
5641 CCGGGAAGCC GATCTCGGCT TGAACGAATT GTTAGGTGGC GGTACTTGGG TCGATATCAA 
57 01 AGTGCATCAC TTCTTCCCGT ATGCCCAACT TTGTATAGAG AGCCACTGCG GGATCGTCAC 
5761 CGTAATCTGC TTGCACGTAG ATCAC ATAAG CACCAAGCGC GTTGGCCTCA TGCTTGAGGA 
5821 GATTGATGAG CGCGGTGGCA ATGCCCTGCC TCCGGTGCTC GCCGGAGACT G C GAG AT CAT 
5881 AGATATAGAT CTCACTACGC GGCTGCTCAA ACCTGGGCAG AACGTAAGCC GCGAGAGCGC 

5 941 CAACAACCGC TTCTTGGTCG AAGGCAGCAA GCGCGATGAA TGTCTTACTA CGGAGCAAGT 

6 001 TCCCGAGGTA ATCGGAGTCC GGCTGATGTT GGGAGTAGGT GGCTACGTCT CCGAACTCAC 
6 061 GACCGAAAAG AT C AAG AG C A GCCCGCATGG ATTTGACTTG GTCAGGGCCG AGCCTACATG- 
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6121 TGCGAATGAT GCCCATACTT GAGCCACCTA 

6181 TAACATCGTT GCTGCTGCGT AACATCGTTG 

6 24 1 GCGTAACGCG CTTGCTGCTT GGATGCCCGA 

6 301 CAAGCCATGA AAACCGCCAC TGCGCCGTTA 

6 361 CAGTTGCGTG AGCGCATACG CTACTTGCAT 

6421 AACTGGGTTC GTGCCTTCAT CCGTTTCCAC 

64 81 AGCGAAGTCG AGGCATTTCT GTCCTGGCTG 

6 541 CATCGTCAGG CATTGGCGGC CTTGCTGTTC 

66 01 CCCTGGCTTC AGGAGATCGG AAGACCTCGG 

66 61 CCGGATGA 



ACTTTGTTTT AGGGCGACTG CCCTGCTGCG 
CTGCTCCATA ACATCAAACA TCGACCCACG 
GGCATAGACT GTACAAAAAA ACAGTCATAA 
CCACCGCTGC GTTCGGTCAA GGTTCTGGAC 
TACAGTTTAC GAACCGAACA GGCTTATGTC 
GGTGTGCGTC ACCCGGCAAC CTTGGGCAGC 
GCGAACGAGC GCAAGGTTTC GGTCTCCACG 
TTCTACGGCA AGGTGCTGTG CACGGATCTG 
CCGTCGCGGC GCTTGCCGGT GGTGCTGACC 



WO 00/52027 / O / , PCT/US00/05432 



iOK P DEST20 Glutathione-S-transferase Fusion with 
Polyhedron Promoter for Baculovirus Expression 



430 



481 



532 



ggc tac gta tac tec gga ata 
ccg atg cat atg agg cct tat 



PofTT fVoi»<-o4? g.> 



aac cat etc gca aat aaa tea 
/£tg gta qacr cat tta ttt affl : 



tta ata gat cat gga gat aat taa aatgat 
aat tat e ta gta cct eta tta att tta ct n 



gta ttt tac tgt ttt cgt aac agt ttt gta^ f 
-Cat aaa atg aca aaa gca ttg tea aaa cat 



ata aaa aaa cct ata aat 

Jtat ttt ttt gga tat tta 

583 cgc gga tec 
gcg cct agg 



att 
taa 

aCa 

tac egg gga tat 



ccg gat tat tea tac cgt ccc acc ate ggg 
ggc eta ata agt atg gca ggg tgg tag ccc 

- - ~ - <3ST- - 

eta ggt tat tgg aaa att aag ggc ctt gtg ,/ 
gat cca ata acc ttt taa ttc ccg gaa cac ' 



S D L_ V p 

1246 /t_tcg gat ctg gtt ccg cgt cat aat caa 
' age eta gac caa ggc gca gta tta gtt 



«^ a m a 

tgt cat aat caa 



T S 



^ K K "A 



1297 



aca agt tt ta tac aaa aaa get gaa 
tgt tea aac atg tt t^t tt cga ctt 



3*+ 

cga gaa acg taa aat gat ata aat ate aat ata tta aat tag at 
get ctt tgc att tta eta tat tta tag tta tat aat tta ate ta 



Swa I 6815 
A/spV6787 
Bsg I 6656 

ECOO109 I 6422 
EcoN I 6397 
SnaB I 6232 
Bbs I 6103 



Not\ 131 

Pvu II 350 



Eco Rl 452 

PffM I 678 

Xba I 899 
BssH II 946 



TthA 1 1 I 5578 

BseR I 5299 . 
EcoR V 5176 
SstU 5118 




Srf 11 320 
Sma\ 1321 
P$t\ 1574 
BspM I 1576 
Sa/I 1580 

Mun\ 1830 
Hpal 1841 

AvrU 1994 



A/poA IV 2480 
Ban il 2510 



Fsp I 3504 



BsaH I 3188 
Pvu I 3357 
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pDEST20 7066 bp (rotated to position 5800) 



Location (Base Nos.) 
592 . . 1263 
1397 . . 1273 
1506 . .2165 
2285 . .2369 
2507 . .2812 
2853 . .2977 
4214 . . 5064 
5263 . . 5843 



Gene Encoded 

GST 

attRl 

CmR 

inactivated ccdA 

ccdB 

attR2 

ampR 

ori 



1 CCACTGCGCC GTTACCACCG CTGCGTTCGG TCAAGGTTCT GGACCAGTTG CGTGAGCGCA 

61 TACGCTACTT GCATTACAGT TTACGAACCG AACAGGCTTA TGTCAACTGG GTTCGTGCCT 

121 TCATCCGTTT CCACGGTGTG CGTCACCCGG CAACCTTGGG CAGCAGCGAA GTCGAGGCAT 

181 TTCTGTCCTG GCTGGCGAAC GAGCGCAAGG TTTCGGTCTC CACGCATCGT CAGGCATTGG 

241 CGGCCTTGCT GTTCTTCTAC GGCAAGGTGC TGTGCACGGA TCTGCCCTGG CTTCAGGAGA 

3 01 TCGGAAGACC TCGGCCGTCG CGGCGCTTGC CGGTGGTGCT GACCCCGGAT GAAGTGGTTC 

361 GCATCCTCGG TTTTCTGGAA GGCGAGCATC GTTTGTTCGC CCAGGACTCT AGCTATAGTT 

421 CTAGTGGTTG GCTACGTATA CTCCGGAATA TTAATAGATC ATGGAGATAA TTAAAATGAT 

481 AACCATCTCG CAAATAAATA AGTATTTTAC TGTTTTCGTA ACAGTTTTGT AATAAAAAAA 

541 CCTATAAATA TTCCGGATTA TTCATACCGT CCCACCATCG GGCGCGGATC CATGGCCCCT 

6 01 ATACTAG&TT ATTGGAAAAT TAAGGGCCTT GTGCAACCCA CTCGACTTCT TTTGGAATAT 

661 CTTGAAGAAA AATATGAAGA G C ATTTG TAT GAGCGCGATG AAGGTGATAA ATGGCGAAAC 

721 AAAAAGTTTG AATTGGGTTT GGAGTTTCCC AATCTTCCTT ATTATATTGA TGGTGATGTT 

781 AAATTAACAC AGTCTATGGC CATCATACGT TATATAGCTG ACAAGCACAA CATGTTGGGT 

841 GGTTGTCCAA AAGAGCGTGC AGAGATTTCA ATG CTTGAAG GAG CGGTTTT GGATATTAGA 

901 TACGGTGTTT CGAGAATTGC ATATAGTAAA GACTTTGAAA CTCTCAAAGT TGATTTTCTT 

961 AGCAAG CTAC CTGAAATGCT GAAAATGTTC GAAGATCGTT TATGTCATAA AACATATTTA 

1021 AATGGTGATC ATGTAACC C A TCCTGACTTC ATGTTGTATG ACGCTCTTGA TGTTGTTTTA 

1081 TACATGGACC CAATGTGCCT GGATGCGTTC CCAAAATTAG TTTGTTTTAA AAAACGTATT 

1141 GAAGCTATCC CACAAATTGA TAAGTACTTG AAATCCAGCA AGTATATAGC ATGGCCTTTG 

12 01 CAGGGCTGGC AAGCCACGTT TGGTGGTGGC GACCATCCTC CAAAATCGGA TCTGGTTCCG 

12 61 CGTCATAATC AAACAAGTTT GTACAAAAAA GCTGAACGAG AAACGTAAAA TGATATAAAT 
1321 ATCAATATAT TAAATTAGAT TTTGCATAAA AAACAGACTA CATAATACTG TAAAACACAA 

13 81 CATATC CAGT CACTATGGCG GCCGCATTAG GCACCCCAGG CTTTACACTT TATGCTTCCG 
1441 GCTCGTATGT TGTGTGGATT TTGAGTTAGG ATCCGGCGAG ATTTTCAGGA GCTAAGGAAG 

15 01 CTAAAATGGA GAAAAAAATC ACTGGATATA CCACCGTTGA TATATCCCAA TGGCATCGTA 
1561 AAGAACATTT TGAGGCATTT CAGTCAGTTG CTCAATGTAC C TATAAC C AG ACCGTTCAGC 
1621 TGGATATTAC GGCCTTTTTA AAGACCGTAA AGAAAAATAA G C AC AAGTTT TATCCGGCCT 

16 81 TTATTCACAT TCTTGCCCGC CTGATGAATG CTCATCCGGA ATTCCGTATG GCAATGAAAG 
1741 ACGGTGAGCT GGTGATATGG GATAGTGTTC AC C CTTGTTA CACCGTTTTC C ATG AG C AAA 
18 01 CTGAAACGTT TTCATCGCTC TGGAGTGAAT ACCACGACGA TTTCCGGCAG TTTCTACACA 
18 61 TATATTCGCA AGATGTGGCG TGTTACGGTG AAAACCTGGC CTATTTCCCT AAAGGGTTTA 

,1921 TTGAGAATAT GTTTTTCGTC TCAGCCAATC CCTGGGTGAG TTTCACCAGT TTTGATTTAA 

1981 ACGTGGCCAA TATGGACAAC TTCTTCGCCC CCGTTTTCAC CATGGGCAAA TATTATACGC 

2 041 AAGGCGACAA GGTGCTGATG CCGCTGGCGA TTCAGGTTCA TCATGCCGTC TGTGATGGCT 

2101 TCCATGTCGG CAGAATGCTT AATGAATTAC AACAGTACTG CGATGAGTGG CAGGGCGGGG 
2161* CGTAATCTAG AGGATCCGGC TTACTAAAAG C C AG AT AAC A GTATGCGTAT TTGCGCGCTG 

2221 ATTTTTGCGG TATAAGAATA TATACTGATA TGTATACCCG AAGTATGTCA AAAAGAGGTG 

22 81 TGCTATGAAG C AG CGTATTA CAGTGACAGT TGAC AG CGAC AGCTATCAGT TGCTCAAGGC 

2 341 ATATATGATG TCAATATCTC CGGTCTGGTA AG C AC AAC C A TGCAGAATGA AGCCCGTCGT 

24 01 CTGCGTGCCG AACGCTGGAA AGCGGAAAAT CAGGAAGGGA TGGCTGAGGT CGCCCGGTTT 

24 61 ATTGAAATGA ACGGCTCTTT T G C TGAC GAG AACAGGGACT GGTGAAATGC AGTTTAAGGT 

2 521 TTACAC CTAT AAAAGAGAGA GCCGTTATCG TCTGTTTGTG G ATG TAC AG A GTGATATTAT 

2 581 TGACACGCCC GGGCGACGGA TGGTGATCCC CCTGGCCAGT GCACGTCTGC TGTCAGATAA 

2 641 AGTCTCCCGT GAACTTTACC CGGTGGTGCA TATCGGGGAT GAAAGCTGGC GC ATG ATG AC - 



HUB* HOfc 
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27 01 CACCGATATG GCCAGTGTGC CGGTCTCCGT TATCGGGGAA GAAGTGG CTG ATCTCAGCCA 
27 61 CCGCGAAAAT G AC AT C AAAA ACGCCATTAA CCTGATGTTC TGGGGAATAT AAATGTCAGG 
2 821 CTCCCTTATA CACAGCCAGT CTGCAGGTCG AC CAT AG TG A CTGGATATGT TGTGTTTTAC 
2 881 AGTATTATGT AGTCTGTTTT TTATGCAAAA TCTAATTTAA TATATTGATA TTTATATCAT 

2 941 TTTACGTTTC TCGTTCAGCT TTCTTGTACA AAGTGGTTTG ATAGCTTGTC GAGAAGTACT 

3 001 AGAGGATCAT AAT C AG C CAT AC C AC ATTTG TAGAGGTTTT ACTTGCTTTA AAAAACCTCC 
3 061 CACACCTCCC CCTGAACCTG AAACATAAAA TGAATGCAAT TGTTGTTGTT AACTTGTTTA 
3121 TTGCAGCTTA TAATGGTTAC AAATAAAGCA ATAGCATCAC AAATTTCACA AATAAAGCAT 
3181 TTTTTTCACT GCATTCTAGT TGTGGTTTGT CCAAACTCAT CAATGTATCT TATCATGTCT 

32 41 GGATCTGATC ACTGCTTGAG C CTAGGAGAT CCGAACCAGA TAAGTGAAAT CTAGTTCCAA 

33 01 ACTATTTTGT CATTTTTAAT TTT CGTATTA GCTTACGACG CTACACCCAG TTCCCATCTA 
3 361 TTTTGTCACT CTTCCCTAAA TAATC CTTAA AAACTCCATT TCCACCCCTC CCAGTTCCCA 
3 421 ACTATTTTGT CCGCCCACAG CGGGGCATTT TTCTTCCTGT TATGTTTTTA ATCAAACATC 

34 81 CTGCCAACTC CATGTGACAA ACCGTCATCT TCGGCTACTT TTTCTCTGTC ACAGAATGAA 
3 541 AATTTTTCTG TCATCTCTTC GTTATTAATG TTTGTAATTG ACTGAATATC AACGCTTATT 

36 01 TGCAGCCTGA ATGGCGAATG GACGCGCCCT GTAGCGGCGC ATTAAGCGCG GCGGGTGTGG 
3 661 TGGTTACGCG CAGCGTGACC GCTACACTTG CCAGCGCCCT AGCGCCCGCT CCTTTCGCTT 
3 721 TCTTCCCTTC CTTTCTCGCC ACGTTCGCCG GCTTTCCCCG TCAAGCTCTA AATCGGGGGC 

37 81 TCCCTTTAGG GTTCCGATTT AGTGCTTTAC GGCACCTCGA CCCCAAAAAA CTTGATTAGG 
3841 GTGATGGTTC ACGTAGTGGG CCATCGCCCT GATAGACGGT TTTTCGCCCT TTGACGTTGG 
3 901 AGTCCACGTT CTTTAATAGT GGACTCTTGT TCCAAACTGG AACAACACTC AACCCTATCT 

3 961 CGGTCTATTC TTTTGATTTA TAAGGGATTT TGCCGATTTC GGCCTATTGG TTAAAAAATG 

4 021 AGCTGATTTA ACAAAAATTT AACGCGAATT TTAACAAAAT ATTAACGTTT ACAATTTCAG 
4 081 GTGGCACTTT TCGGGGAAAT GTGCGCGGAA CCCCTATTTG TTT ATTTTT C TAAATACATT 
4141 CAAATATGTA TCCGCTCATG AGACAATAAC C CTG ATAAAT GCTTCAATAA TATTGAAAAA 
42 01 GGAAGAGTAT GAGTATTCAA CATTTCCGTG TCGCCCTTAT TCCCTTTTTT GCGGCATTTT 
42 61 GCCTTCCTGT TTTTGCTCAC CCAGAAACGC TGGTGAAAGT AAAAGATGCT GAAGATCAGT 
4 321 TGGGTGCACG AGTGGGTTAC ATCGAACTGG ATCTCAACAG C GGTAAG AT C CTTGAGAGTT 
4381 TTCGCCCCGA AGAACGTTTT CCAATGATGA GCACTTTTAA AGTTCTGCTA TGTGGCGCGG 
4441 TATTATCCCG TATTGACGCC GGGCAAGAGC AACTCGGTCG CCGCATACAC TATTCTCAGA 

45 01 ATGACTTGGT TGAGTACTCA CCAGTCACAG AAAAGCATCT TACGGATGGC ATGACAGTAA 
4 561 GAGAATTATG CAGTGCTGCC ATAAC C ATG A GTGATAACAC TGCGGCCAAC TTACTTCTGA 
4 621 CAACGATCGG AGGACCGAAG GAGCTAACCG CTTTTTTGCA CAACATGGGG GATCATGTAA 

46 81 CTCGCCTTGA TCGTTGGGAA CCGGAGCTGA ATG AAG C CAT AC C AAACGAC GAG C GTG AC A 
4 741 CCACGATGCC TGTAG C AATG GCAACAACGT TGCGCAAACT ATTAACTGGC GAACTACTTA 
48 01 CTCTAGCTTC CCGGCAACAA TTAATAGACT GGATGGAGGC GGATAAAGTT GCAGGACCAC 
48 61 TTCTGCGCTC GGCCCTTCCG GCTGGCTGGT TTATTGCTGA TAAATCTGGA GCCGGTGAGC 
4 921 GTGGGTCTCG CGGTATCATT GCAGCACTGG GGCCAGATGG TAAGCCCTCC CGTATCGTAG 

4 981 TT AT C T AC AC GACGGGGAGT CAGGCAACTA TGGATGAACG AAATAGACAG AT C G C TG AG A 

5 041 TAGGTGCCTC- ACTGATTAAG CATTGGTAAC TGTCAGACCA AGTTTACTCA TAT AT AC TTT 
5101 AGATTGATTT AAAAC TT C AT TTTTAATTTA AAAGGAT CTA GGTGAAGATC CTTTTTGATA 
5161 ATCTCATGAC CAAAATCCCT TAACGTGAGT TTTCGTTCCA CTGAGCGTCA GACCCCGTAG 
5221 AAAAGATCAA AGGATCTTCT TGAGATCCTT TTTTTCTGCG CGTAATCTGC TGCTTGCAAA 
52 81 CAAAAAAACC ACCGCTACCA GCGGTGGTTT GTTTGCCGGA T C AAG AG CTA CCAACTCTTT 
5 341 TTCCGAAGGT AACTGGCTTC AGCAGAGCGC AGATAC C AAA TACTGTCCTT CTAGTGTAGC 

'5401 CGTAGTTAGG CCACCACTTC AAGAACTCTG TAGCACCGCC TACATACCTC GCTCTGCTAA 
54 61 TCCTGTTACC AGTGGCTGCT GCCAGTGGCG ATAAGTCGTG TCTTAC CGGG TTGGACTCAA 
5521 GACGATAGTT ACCGGATAAG GCGCAGCGGT CGGGCTGAAC GGGGGGTTCG TG C AC AC AG C 
5 581 CCAGCTTGGA GCGAACGACC TACACCGAAC TGAGATACCT ACAGCGTGAG CATTGAGAAA 

5 641 GCGCCACGCT TCCCGAAGGG AGAAAGGCGG ACAGGTATCC GGTAAGCGGC AGGGTCGGAA 

57 01 CAGGAGAGCG CACGAGGGAG CTTC CAGGGG GAAACGCCTG GTATCTTTAT AGTCCTGTCG 
5761 GGTTTCGCCA CCTCTGACTT GAGCGTCGAT TTTTGTGATG CTCGTCAGGG GGGCGGAGCC 
5821 TATGGAAAAA CGCCAGCAAC GCGGCCTTTT TACGGTTCCT GGCCTTTTGC TGGCCTTTTG 

58 81 CTCACATGTT CTTTCCTGCG TTATCCCCTG ATTCTGTGGA TAACCGTATT ACCGCCTTTG 
5941 AGTGAGCTGA TACCGCTCGC CGCAGCCGAA CGACCGAGCG CAGCGAGTCA GTGAGCGAGG 

6 0 01 AAGCGGAAGA GCGCCTGATG CGGTATTTTC TCCTTACGCA TCTGTGCGGT ATTTCACACC 
6 061 G C AG AC C AG C CGCGTAACCT GGCAAAATCG GTTACGGTTG AGTAATAAAT GGATGCCCTG 
6121 CGTAAGCGGG TGTGGGCGGA CAATAAAGTC TTAAACTGAA CAAAATAGAT CTAAACTATG - 
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6181 ACAATAAAGT CTTAAACTAG ACAGAATAGT TGTAAACTGA AATCAGTCCA GTTATG CTGT 
6 241 G AAAAAG CAT ACTGGACTTT TGTTATGGCT AAAG C AAAC T CTTCATTTTC TGAAGTGCAA 

63 01 ATTGCCCGTC GTATTAAAGA GGGGCGTGGC CAAGGGCATG GT AAAG ACT A TATTCGCGGC 
6 361 GTTGTGACAA TTTAC CGAAC AACTCCGCGG CCGGGAAGCC GATCTCGGCT TGAACGAATT 
6421 GTTAGGTGGC GGTACTTGGG T C GAT AT C AA AGTGCATCAC TTCTTCCCGT ATGCCCAACT 

64 81 TTGTATAGAG AGCCACTGCG GGATCGTCAC CGTAATCTGC TTGCACGTAG ATCACATAAG 
6 541 CACCAAGCGC GTTGGCCTCA TGCTTGAGGA GATTGATGAG CGCGGTGGCA ATGCCCTGCC 
66 01 TCCGGTGCTC GCCGGAGACT GCGAGATCAT AGATATAGAT CTCACTACGC GGCTGCTCAA 
6661 ACCTGGGCAG AACGTAAGCC GCGAGAGCGC CAACAACCGC TTCTTGGTCG AAGGCAGCAA 
6 721 GCGCGATGAA TGTCTTACTA CGGAGCAAGT TCCCGAGGTA ATCGGAGTCC GGCTGATGTT 
6 781 GGGAGTAGGT GGCTACGTCT CCGAACTCAC G AC C G AAAAG ATCAAGAGCA GCCCGCATGG 
68 41 ATTTGACTTG GTCAGGGCCG AGC CTACATG TGCGAATGAT GCCCATACTT GAGCCACCTA 
6 9 01 ACTTTGTTTT AGGGCGACTG CCCTGCTGCG TAACATCGTT GCTGCTGCGT AACATCGTTG 

6 961 CTGCTCCATA ACATCAAACA TCGACCCACG GCGTAACGCG CTTGCTGCTT GGATGCCCGA 

7 021 GGCATAGACT GTACAAAAAA ACAGTCATAA CAAGCCATGA AAACCG 
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Figure ^Sr2i 2-Hybrid Vector WW* 

DNA-Binding Domain 
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ADM VxnvtiCteyr 

tt^- qcy >ctfe ^gc/ta<t caa ata' ^ea ata gac ctg cda tta tta -ate 
aat? aaa agb a^ta ^rtt crft att ta t ^fg yac att aat aat tag 

ttc etc gtc att gtt etc gtt ccc ttt ctt cct tg/t ttc JCt>y ttc fcge. aca^ 
ff«q .claq taa caa oaq caa acra aaa oaa gga aca yag a aa aaa acq figtyy 
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pDEST21 11713 bp (rotated to position 11000) 



Location (Base Nos 



857 . 
1456 
1706 
2485 
2707 
3053 
3716 
3899 
4414 
7541 
9668 



1322 
. 1332 
.2365 
. 2569 
. 3012 
.3177 
. 3735 
.4354 
. 6642 
.8515 
. 10958 



11118 . . 848 



Gene Encoded 

GAL4DB 

attRl 

CmR 

inactivated ccdA 

ccdB 

attR2 

pT7 (T7 promoter; 

fl (fl intergenic region) 

Leu2 

kanR 

CYH2 

pADH { ADH promoter) 



1 TTTATTATGT T AC AATATG G AAGG GAACTT TACACTTCTC CTATGCACAT ATATTAATTA 

61 AAGTCCAATG CTAGTAGAGA AGGGGGGTAA CACCCCTCCG CGCTCTTTTC CGATTTTTTT 

121 CTAAACCGTG GAATATTTCG GATATCCTTT TGTTGTTTCC GGGTGTACAA TATGGACTTC 

181 CTCTTTTCTG GCAACCAAAC CCATACATCG GGATTCCTAT AATACCTTCG TTGGTCTCCC 

241 TAACATGTAG GTGGCGGAGG GGAGATATAC AATAGAACAG ATACCAGACA AGACATAATG 

3 01 GGCTAAACAA GACTACACCA ATTACACTGC CTCATTGATG GTGGTACATA ACGAACTAAT 

361 ACTGTAGCCC TAGACTTGAT AGCCATCATC ATAT CGAAGT TTCACTACCC TTTTTCCATT 

421 TGCCATCTAT TGAAGTAATA ATAGGCG CAT GCAACTTCTT TTCTTTTTTT TTCTTTTCTC 

481 TCTCCCCCGT TGTTGTCTCA CCATATCCGC AATGACAAAA AAAATGATGG AAGACACTAA 

541 AGGAAAAAAT T AAC G AC AAA GACAG CACCA ACAGATGTCG TTGTTCCAGA GCTGATGAGG 

6 01 GGTATCTTCG AAC AC AC G AA ACTTTTTCCT TCCTTCATTC ACGCACACTA CTCTCTAATG 

661 AGCAACGGTA TACGGCCTTC CTTCCAGTTA CTTGAATTTG AAATAAAAAA AGTTTGCCGC 

721 TTTG CTATC A AGTATAAATA GACCTGCAAT TATTAATCTT TTGTTTCCTC GTCATTGTTC 

781 TCGTTCCCTT TCTTCCTTGT TTCTTTTTCT G C AC AATATT TCAAG CTATA CCAAGCATAC 

841 AATCAACTCC AAGCTTGAAG CAAGCCTCCT GAAAGATGAA GCTACTGTCT TCTATCGAAC 

901 AAGCATGCGA TATTTGCCGA CTTAAAAAGC TCAAGTGCTC CAAAGAAAAA CCGAAGTGCG 

961 CCAAGTGTCT GAAGAACAAC TGGGAGTGTC GCTACTCTCC CAAAACCAAA AGGTCTCCGC 

1021 TGACTAGGGC ACATCTGACA GAAGTGGAAT CAAGGCTAGA AAGACTGGAA CAGCTATTTC 

1081 TACTGATTTT TCCTCGAGAA GACCTTGACA TGATTTTGAA AATGGATTCT TTACAGGATA 

1141 TAAAAGCATT GTTAACAGGA TTATTTGTAC AAGATAATGT GAATAAAGAT GCCGTCACAG 

12 01 ATAGATTGGC TTCAGTGGAG ACTGATATGC CTCTAACATT GAGACAGCAT AGAATAAGTG 
1261 C GACAT CATC ATCGGAAGAG AGTAGTAACA AAGGTCAAAG ACAGTTGACT GTATCGTCGA 
1321 GGTCGAATCA AACAAGTTTG TACAAAAAAG CTGAACGAGA AACGTAAAAT GATATAAATA 

13 81 TCAATATATT AAATTAGATT TTGCATAAAA AACAGACTAC ATAATACTGT AAAACACAAC 
1441 ATAT C CAGTC ACTATGGCGG CCGCTAAGTT GGCAGCATCA CCCGACGCAC TTTGCGCCGA 
1501 ATAAATACCT GTGACGGAAG ATCACTTCGC AGAATAAATA AATCCTGGTG TCCCTGTTGA 
1561 TACCGGGAAG CCCTGGGCCA ACTTTTGGCG AAAATGAGAC GTTGATCGGC ACGTAAGAGG 
1621 TTCCAACTTT C AC C ATAATG AAATAAGATC ACTACCGGGC GTATTTTTTG AGTT AT C GAG 
16 81 ATTTTCAGGA GCTAAGGAAG CTAAAATGGA GAAAAAAATC ACTGGATATA C C AC CGTTGA 
1'741 TATATCCCAA TGGCATCGTA AAGAACATTT TGAGGCATTT CAGTCAGTTG CTCAATGTAC 
18 01 CTATAAC C AG ACCGTTCAGC TGGATATTAC GGCCTTTTTA AAGACCGTAA AGAAAAATAA 
18 61 G C AC AAGTTT TATCCGGCCT TTATTCACAT TCTTGCCCGC CTGATGAATG CTCATCCGGA 
1921> ATTCCGTATG GCAATGAAAG ACGGTGAGCT GGTGATATGG GATAGTGTTC ACCCTTGTTA 
1981 CACCGTTTTC C ATG AG C AAA CTGAAACGTT TTCATCGCTC TGGAGTGAAT ACCACGACGA 
2 041 TTTCCGGCAG TTTCTACACA TATATTCG C A AGATGTGGCG TGTTACGGTG AAAACCTGGC 
2101 CTATTTCCCT AAAGGGTTTA TTGAGAATAT GTTTTTCGTC TCAGCCAATC CCTGGGTGAG 
2161 TTTCACCAGT TTTGATTTAA ACGTGGCCAA TATGGACAAC TTCTTCGCCC CCGTTTTCAC 
2221 CATGGGCAAA TATTATACGC AAGGCGACAA GGTGCTGATG CCGCTGGCGA TTCAGGTTCA 
22 81 TCATGCCGTC TGTGATGGCT TCCATGTCGG CAGAATGCTT AATGAATTAC AACAGTACTG 
2 341 CGATGAGTGG CAGGGCGGGG CGTAAT CTAG AGGATCCGGC TTACTAAAAG CCAGATAACA 
24 01 GTATGCGTAT TTGCGCGCTG ATTTTTGCGG TATAAGAATA TATACTGATA TGTATACCCG- 
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24 61 AAGTATGTCA AAAAGAGGTG TGCTATGAAG 
2 521 AGCTATCAGT TGCTCAAGGC ATATATGATG 

25 81 TGCAGAATGA AGCCCGTCGT CTGCGTGCCG 
2 641 TGGCTGAGGT CGCCCGGTTT ATTGAAATGA 
2 7 01 GGTGAAATGC AGTTTAAGGT TT AC AC C TAT 

27 61 GATGTACAGA GTGATATTAT TGACACGCCC 
2 821 GCACGTCTGC TGTCAGATAA AGTCTCCCGT 

28 81 GAAAGCTGGC GCATGATGAC CAC CGATATG 

2 941 GAAGTGGCTG ATCTCAGCCA CCGCGAAAAT 

3 001 TGGGGAATAT AAATGTCAGG CTCCCTTATA 
3 061 CTGGATATGT TGTGTTTTAC AGTATTATGT 
3121 TATATTGATA TTTATAT CAT TTTACGTTTC 
3181 ATGGCCGCTA AGTAAGTAAG ACGTCGAGCT 
3 241 AG CTTTGGAC TTCTTCGCCA GAGGTTTGGT 

33 01 TACCTTGCCA GAAATTTACG AAAAGATGGA 
3 361 TGACACTTCT AAATAAG CGA ATTTCTTATG 

34 21 AAAAAAAATA AGTGTATACA AATTTTAAAG 
3481 ATTCTTGAGT AACTCTTTCC TGTAGGTCAG 
3 541 TCTTATTGAC CACACCTCTA CCGGCATGCC 
3 6 01 CACCCAATTG TAGATATGCT AACTCCAGCA 
3 661 TGTC CTCAGA GGACAATACC TGTTGTAATC 
3 721 TAGTGAGTCG TATTACAATT CACTGGCCGT 
3 781 TGGCGTTACC CAACTTAATC GCCTTGCAGC 
3 841 CGAAGAGGCC CGCACCGATC GCCCTTCCCA 
3 901 GCGCCCTGTA GCGGCGCATT AAGCGCGGCG 

3 961 ACACTTGCCA GCGCCCTAGC GCCCGCTCCT 

4 021 TTCGCCGGCT TTCCCCGTCA AGCTCTAAAT 
4 081 GCTTTACGGC ACCTCGACCC CAAAAAACTT 
4141 TCGCCCTGAT AGACGGTTTT TCGCCCTTTG 
42 01 CTCTTGTTCC AAACTGGAAC AACACTCAAC 
42 61 GGGATTTTGC CGATTTCGGC CTATTGGTTA 
4 321 GCGAATTTTA ACAAAATATT AACGTTTACA 
4 381 ATCTGTGCGG TATTTCACAC CGCATATCGA 
4 441 ATACCTAATA TTATTGCCTT ATTAAAAATG 
4 501 TCTCTTCAAA ATCAATTGTC CTGTACTTCC 
4 561 TAT AG G AT AA TTATACTCTA TTTCTCAACA 
4 621 GGCGCCTGAT TCAAGAAATA TCTTGACCGC 
4681 AGATGCAAGA GTTCGAATCT CTTAGCAACC 
4 741 CACAGGGGCG CTATCGCACA GAATCAAATT 
48 01 AGGTCGCCTG ■ ACGCATATAC CTTTTTCAAC 
4861 AGGCCGGAAC CGGCTTTTCA TATAGAATAG 
4 921 CAATACTTGA AGTTGACAAT ATTATTTAAG 

4 981 CAATCGTCTT ACTTTCTAAC TTTTCTTACC 

5 041 TCAAGGATAT ACCATTCTAA TGTCTGCCCC 
5101 GCCAGGTGAC CACGTTGGTC AAGAAATCAC 
5161 TTCTGATGTT CGTTCCAATG TCAAGTTCGA 
5221 TATCGATGCT ACAGGTGTCC CACTTCCAGA 
52 81 TGCCGTTTTG TTAGGTGCTG TGGGTGGTCC 
5341 ACAAGGTTTA CTAAAAATCC GTAAAGAACT 
54 01 CTTTGCATCC GACTCTCTTT TAGACTTATC 

54 61 TGACTTCGTT GTTGTCAGAG AATTAGTGGG 
5 521 CGATGGTGAT GGTGTCGCTT GGGATAGTGA 

55 81 CACAAGAATG GCCGCTTTCA TGGCCCTACA 
5 641 GGATAAAGCT AATGTTTTGG CCTCTTCAAG 
57 01 CAAGAACGAA TTCCCTACAT TGAAGGTTCA 

57 61 CCTAGTTAAG AACCCAACCC AC CTAAATGG 
5821 TATCATCTCC GATGAAGCCT CCGTTATCCC 

58 81 CTTGGCCTCT TTGCCAGACA AG AAC AC C G C 



C AG CGTATTA CAGTGACAGT TGACAGCGAC 
TCAATATCTC CGGTCTGGTA AG CAC AAC C A 
AACGCTGGAA AGCGGAAAAT CAGGAAGGGA 
ACGGCTCTTT TGCTGACGAG AACAGGGACT 
AAAAGAGAGA GCCGTTATCG TCTGTTTGTG 
GGGCGACGGA TGGTGATCCC CCTGGCCAGT 
GAACTTTACC CGGTGGTGCA TATCGGGGAT 
GCCAGTGTGC CGGTCTCCGT TATCGGGGAA 
G AC AT C AAAA ACGCCATTAA CCTGATGTTC 
CACAGCCAGT CTGCAGGTCG ACCATAGTGA 
AGTCTGTTTT TTATG C AAAA TCTAATTTAA 
TCGTTCAGCT TTCTTGTACA AAGTG GTTTG 
CTAAGTAAGT AACGGCCGCC AC CG C GGTGG 
CAAGTCTCCA ATCAAGGTTG TCGGCTTGTC 
AAAGGGTCAA ATCGTTGGTA GATACGTTGT 
ATTTATGATT TTTATTATTA AATAAGTTAT 
TGACTCTTAG GTTTTAAAAC GAAAATTCTT 
GTTGCTTTCT CAGGTATAGC ATGAGGTCGC 
GAGCAAATGC CTGCAAATCG CTCCCCATTT 
ATGAGTTGAT GAATCTCGGT GTGTATTTTA 
GTTCTTCCAC ACGGATCCCA ATTCGCCCTA 
CGTTTTACAA CGTCGTGACT GGGAAAACCC 
ACATCCCCCT TTCGCCAGCT GGCGTAATAG 
ACAGTTG CGC AGCCTGAATG GCGAATGGAC 
GGTGTGGTGG TTACGCGCAG CGTGACCGCT 
TTCGCTTTCT TCCCTTCCTT TCTCGCCACG 
CGGGGGCTCC CTTTAGGGTT CCGATTTAGT 
GATTAGGGTG ATGGTTCACG TAGTGGGCCA 
ACGTTGGAGT C CAC GTTCTT TAATAGTGGA 
CCTATCTCGG TCTATTCTTT TGATTTATAA 
AAAAATG AG C TGATTTAACA AAAATTTAAC 
ATTTCCTGAT GCGGTATTTT CTCCTTACGC 
CCGGTCGAGG AGAACTTCTA G TAT AT C CAC 
GAATCGGAAC AATTAC AT C A AAATCCACAT 
TTGTTCATGT GTGTTCAAAA AC GTTATATT 
AGTAATTGGT TGTTTGGC CG AGCGGTCTAA 
AGTTAACTGT GGGAATACTC AGGTATCGTA 
ATTATTTTTT TCCTCAACAT AACGAGAACA 
CGATGACTGG AAATTTTTTG TT AATTT C AG 
TGAAAAATTG GGAGAAAAAG GAAAGGTGAG 
AGAAGCGTTC ATGACTAAAT GCTTG CATCA 
G AC CTATTGT TTTTTCCAAT AGGTGGTTAG 
TTTTACATTT CAGCAATATA TATATATATT 
TATGTCTGCC CCTAAGAAGA TCGTCGTTTT 
AGCCGAAGCC ATTAAGGTTC TTAAAG CTAT 
TTTCGAAAAT CATTTAATTG GTGGTGCTGC 
TGAGGCGCTG GAAGCCTCCA AGAAGGTTGA 
TAAATGGGGT ACCGGTAGTG TTAGACCTGA 
TCAATTGTAC G C C AAC TT AA GACCATGTAA 
TCCAATCAAG C C AC AATTTG CTAAAGGTAC 
AGG TATTTAC TTTGGTAAGA GAAAGGAAGA 
AC AATAC AC C GTTC CAGAAG TGCAAAGAAT 
ACATGAGCCA CCATTGCCTA TTTGGTCCTT 
ATTATGGAGA AAAACTGTGG AGGAAACCAT 
ACATCAATTG ATTGATTCTG CCGCCATGAT 
TATTATAATC ACCAGCAACA TGTTTGGTGA 
AGGTTCCTTG GGTTTGTTGC CATCTGCGTC 
ATTTGGTTTG TACGAACCAT GCCACGGTTC - 



Hie- 
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5 941 TGCTCCAGAT TTGCCAAAGA ATAAG GTTG A CCCTATCGCC ACTATCTTGT CTGCTGCAAT 

6 0 01 GATGTTGAAA TTGTCATTGA ACTTGCCTGA AGAAGGTAAG GCCATTGAAG ATGCAGTTAA 
6 061 AAAGGTTTTG GATGCAGGTA TCAGAACTGG TGATTTAGGT GGTTCCAACA GTACCACCGA 
6121 AGTCGGTGAT GCTGTCGCCG AAGAAGTTAA GAAAATCCTT GCTTAAAAAG ATTCTCTTTT 
6181 TTTATGATAT TTGTACATAA AC TTTATAAA TGAAATTCAT AATAGAAACG ACACGAAATT 
6 241 AC AAAATG G A ATATGTT CAT AGGGTAGACG AAACTATATA CGCAATCTAC ATACATTTAT 
63 01 CAAGAAGGAG AAAAAGGAGG ATAGTAAAGG AATACAGGTA AG C AAATTG A TACTAATGGC 

63 61 TCAACGTGAT AAGGAAAAAG AATTGCACTT TAACATTAAT ATTGACAAGG AGGAGGGCAC 
6421 CACACAAAAA GTTAGGTGTA ACAGAAAATC ATGAAACTAC GATTCCTAAT TTGATATTGG 

64 81 AGGATTTTCT CTAAAAAAAA AAAAATACAA CAAATAAAAA ACACTCAATG ACCTGACCAT 
6 541 TTGATGGAGT TTAAGTCAAT ACCTTCTTGA ACCATTTCCC ATAATGGTGA AAGTTCCCTC 
6 6 01 AAGAATTTTA CTCTGTCAGA AACGGCCTTA CGACGTAGTC GATATGGTGC ACTCTCAGTA 
6661 CAATCTGCTC TGATG CCGCA TAGTTAAGCC AGCCCCGACA CCCGCCAACA CCCGCTGACG 
6 721 CGCCCTGACG GGCTTGTCTG CTCCCGGCAT CCGCTTACAG ACAAG CTGTG ACCGTCTCCG 
6781 GGAGCTGCAT GTGTCAGAGG TTTTCACCGT CAT C AC C G AA ACGCGCGAGA CGAAAGGGCC 
6 841 TCGTGATACG CCTATTTTTA TAGGTTAATG TCATGATAAT AATGGTTTCT TAGGACGGAT 
6 901 CGCTTGCCTG TAACTTACAC GCGCCTCGTA TCTTTTAATG ATGGAATAAT TTGGGAATTT 

6 961 ACTCTGTGTT TATTTATTTT TATGTTTTGT ATTTGGATTT TAGAAAGTAA ATAAAGAAGG 

7 021 TAGAAGAGTT ACGGAATGAA GAAAAAAAAA TAAACAAAGG TTTAAAAAAT TTCAACAAAA 
7 081 AGCGTACTTT ACATATATAT TTATTAGACA AGAAAAGCAG ATTAAATAGA TATACATTCG 
714 1 ATTAACGATA AGTAAAATGT AAAAT C AC AG GATTTTCGTG TGTGGTCTTC TACACAGACA 
72 01 AGATGAAACA ATTCGGCATT AATACCTGAG AG CAGGAAGA GCAAGATAAA AGGTAGTATT 

72 61 TGTTGGCGAT CCCCCTAGAG TCTTTTACAT CTTCGGAAAA CAAAAACTAT TTTTTCTTTA 
7321 ATTTCTTTTT TTACTTTCTA TTTTTAATTT ATATATTTAT ATTAAAAAAT TTAAATTATA 

73 81 ATTATTTTTA TAGCACGTGA TGAAAAGGAC CCAGGTGGCA CTTTTCGGGG AAATGTGCGC 
7441 GGAACCCCTA TTTGTTTATT TTTCTAAATA CATTCAAATA TGTATCCGCT CATGAGACAA 

75 01 TAACC CTGAT AAATGCTTCA ATAATCTGCA GCTCTGGCCC GTGTCTCAAA ATCTCTGATG 
7561 TTACATTGCA CAAGATAAAA ATATATCATC ATGAACAATA AAACTGTCTG CTTACATAAA 
7621 CAGTAATACA AGGGGTGTTA TGAGC CATAT TCAACGGGAA ACGTCTTGCT GGAGGCCGCG 

76 81 ATTAAATTCC AACATGGATG CTGATTTATA TGGGTATAAA TGGGCTCGCG ATAATGTCGG 
7741 GCAATCAGGT GCGACAATCT TTCGATTGTA TGGGAAGCCC GATGCGCCAG AGTTGTTTCT 
78 01 GAAACATGGC AAAGGTAGCG TTGCCAATGA TGTTACAGAT GAGATGGTCA GACTAAACTG 
78 61 GCTGACGGAA TTTATGCCTC TTCCGACCAT CAAGCATTTT ATCCGTACTC CTGATGATGC 
7 921 ATGGTTACTC ACCACTGCGA TCCGCGGGAA AACAGCATTC CAGGTATTAG AAGAATATCC 

7 981 TGATTCAGGT GAAAATATTG TTGATGCGCT GGCAGTGTTC CTGCGCCGGT TGCATTCGAT 

8 041 TCCTGTTTGT AATTGTCCTT TTAACAG CGA TCGCGTATTT CGTCTCGCTC AGGCGCAATC 
8101 ACGAATGAAT AACGGTTTGG TTGATG CGAG TGATTTTGAT GACGAGCGTA ATGGCTGGCC 
8161 TGTTGAACAA GTCTGGAAAG AAATGCATAC GCTTTTGCCA TTCTCACCGG ATTCAGTCGT 
8 221 CACTCATGGT GATTTCTCAC TTGATAACCT TATTTTTGAC GAGGGGAAAT TAATAGGTTG 
82 81 TATTGATGTT GGACGAGTCG GAATCGCAGA CCGATACCAG GATCTTGCCA TCCTATGGAA 
8 341 CTGCCTCGGT GAGTTTTCTC CTTCATTACA GAAACGGCTT TTTCAAAAAT ATGGTATTGA 
84 01 TAATCCTGAT ATGAATAAAT TGCAGTTTCA TTTGATGCTC GATGAGTTTT TCTAATCAGA 
8 461 ATTGGTTAAT TGGTTGTAAC ACTGGCAGAG CATTACGCTG ACTTGACGGG ACGGCGCATG 
8 521 ACCAAAATCC CTTAACGTGA GTTTTCGTTC CACTGAGCGT CAGACCCCGT AG AAAAG AT C 
8 581 AAAGGATCTT CTTGAGATCC TTTTTTTCTG CGCGTAATCT GCTGCTTGCA AACAAAAAAA 
8-641 CCACCGCTAC C AG CGGTGGT TTGTTTGCCG GATCAAGAGC TACCAACTCT TTTTCCGAAG 
8 701 GTAACTGGCT TCAGCAGAGC GCAGATACCA AATACTGTCC TTCTAGTGTA GCCGTAGTTA 
8 761 GGCCACCACT TCAAGAACTC TGTAGCACCG CCTACATACC TCGCTCTGCT AATCCTGTTA 
8 821, CCAGTGGCTG CTGCCAGTGG CGATAAGTCG TGTCTTACCG GGTTGGACTC AAGACGATAG 
8 881 TTACCGGATA AGGCGCAGCG GTCGGGCTGA ACGGGGGGTT CGTGCACACA GCCCAGCTTG 

8 941 GAGCGAACGA CCTACACCGA ACTGAGATAC CTACAGCGTG AG CATTGAGA AAGCGCCACG 

9 001 CTTCCCGAAG GGAGAAAGGC GGACAGGTAT CCGGTAAGCG GCAGGGTCGG AACAGGAGAG 
9 061 CGCACGAGGG AGCTTCCAGG GGGGAACGCC TGGTATCTTT ATAGTCCTGT CGGGTTTCGC 
9121 CACCTCTGAC TTGAGCGTCG ATTTTTGTGA TGCTCGTCAG GGGGGCCGAG CCTATGGAAA 
9181 AACGCCAGCA ACGCGGCCTT TTTACGGTTC CTGGCCTTTT GCTGGCCTTT TGCTCACATG 
92 41 TTCTTTCCTG CGTTATCCCC TGATTCTGTG GATAAC CGTA TTACCGCCTT TGAGTGAGCT 
9 3 01 GATACCGCTC GCCGCAGCCG AACGACCGAG CGCAGCGAGT CAGTGAGCGA GGAAGCGGAA 
9361 GAGCGCCCAA TACGCAAACC GCCTCTCCCC GCGCGTTGGC CGATTCATTA ATGCAG CTGG - 
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9421 CACGACAGGT TTCCCGACTG GAAAGCGGGC AGTGAGCGCA ACGCAATTAA TGTGAGTTAC 
94 81 CTCACTCATT AGGCACCCCA GG CTTTAC AC TTTATGCTTC CGGCTCCTAT GTTGTGTGGA 
9 541 ATTGTGAGCG GATAACAATT T C AC AC AG G A AACAGCTATG A C CAT G AT T A CGCCAAGCTC 
96 01 GGAATTAACC CTCACTAAAG G GAAC AAAAG CTGGTACCGA TCCCGAGCTT TGCAAATTAA 
96 61 AGCCTTCGAG CGTCCCAAAA CCTTCTCAAG CAAGGTTTTC AGTATAATGT TACATG CGTA 
9721 CACGCGTCTG TACAGAAAAA AAAGAAAAAT TTGAAATATA AATAACGTTC TTAATACTAA 
9781 CATAACTATA AAAAAATAAA TAGGGACCTA GACTTCAGGT TGTCTAACTC CTTCCTTTTC 
9841 GGTTAGAGCG GATGTGGGGG GAGGGCGTGA ATGTAAG CGT GACATAACTA ATTACATGAT 
9901 ATCGACAAAG GAAAAGGGGC CTGTTTACTC ACAGGCTTTT TTCAAGTAGG TAATTAAGTC 
9 961 GTTTCTGTCT TTTTCCTTCT TCAACCCACC AAAGGCCATC TTGGTACTTT TTTTTTTTTT 
10 021 TTTTTTTTTT TTTTTTTTTT TTTTTT TTTT tttttttttt TTTTTTTTTT TTTTTTTTTT 
10 081 TTTTTTTTTT TTTTTTT ttT TCATAGAAAT AATACAGAAG TAGATGTTGA ATTAGATTAA 
10141 ACTGAAGATA TATAATTTAT TGGAAAATAC ATAG AG CTTT TTGTTGATGC GCTTAAGCGA 
10201 TCAATTCAAC AACACCACCA GCAGCTCTGA TTTTTTCTTC AG C C AACTTG GAGACGAATC 
10261 TAG CTTTGAC GATAACTGGA AC ATTTG GAA TTCTACCCTT ACCCAAGATC TTACCGTAAC 
10321 CGGCTGCCAA AGTGTCAATA ACTGGAGCAG TTT CCTTAGA AGCAGATTTC AAGTATTGGT 
10381 CTCTCTTGTC TTCTGGGATC AATGTC CACA ATTTGTCCAA GTTCAAGACT GGCTTCCAGA 
10441 AATGAGCTTG TTGCTTGTGG AAGTATCTCA TACCAACCTT AC C GAAAT AA CCTGGATGGT 
10501 ATTTATCCAT GTTAATTCTG TGGTGATGTT GACCACCGGC CATACCTCTA CCACCGGGGT 
10561 GCTTTCTGTG CTTAC CGATA CGAC CTTTAC CGG CTGAGAC GTGACCTCTG TGCTTTCTAG 
10621 TCTTAGTGAA TCTGGAAGGC ATTCTTGATT AGTTGGATGA TTGTTCTGGG ATTTAATGCA 
10681 AAAATCACTT AAGAAGGAAA ATCAACGGAG AAAGCAAACG CCATCTTAAA TATACGGGAT 
10741 ACAGATGAAA GGGTTTGAAC CTATCTGGAA AATAGCATTA AACAAGCGAA AAACTGCGAG 
10801 GAAAATTGTT TGCGTCTCTG CGGGCTATTC ACGCGCCAGA GGAAAATAGG AAAAATAACA 
10861 GGGCATTAGA AAAATAATTT TGATTTTGGT AATGTGTGGG TCCTGGTGTA C AG AT GTTAC 
10 921 ATTGGTTACA GTACTCTTGT TTTTGCTGTG TTTTTCGATG AATCTCCAAA ATGGTTGTTA 
10 981 GCACATGGAA GAGTCACCGA TGCTAAGTTA TCTCTATGTA AGCTACGTGG CGTGACTTTT 
11041 GATGAAGCCG CACAAGAGAT ACAGGATTGG CAACTGCAAA TAGAATCTGG GGATCCCCCC 
11101 TCGAGATCCG GGATCGAAGA AATGATGGTA AATGAAATAG GAAATCAAGG AG CATGAAGG 
11161 CAAAAGACAA ATATAAGGGT CGAACGAAAA ATAAAGTGAA AAGTGTTGAT ATGATGTATT 
11221 TGGCTTTGCG GCGCCGAAAA AACGAGTTTA CGCAATTGCA CAATCATGCT GACTCTGTGG 
112 81 CGGACCCGCG CTCTTGCCGG CCCGGCGATA ACGCTGGGCG TGAGGCTGTG CCCGGCGGAG 
11341 TTTTTTGCGC CTGCATTTTC CAAGGTTTAC CCTGCGCTAA GGGGCGAGAT TGGAGAAGCA 
114 01 ATAAGAATGC CGGTTGGGGT TGCGATGATG ACGAC CACGA CAACTGGTGT CATTATTTAA 
114 61 GTTGCCGAAA GAAC CTGAGT GCATTTGCAA CATGAGTATA CTAGAAGAAT GAGCCAAGAC 
11521 TT G CG AG AC G CGAGTTTGCC GGTGGTGCGA AC AAT AG AG C GACCATGACC TTGAAGGTGA 
11581 GACGCGCATA ACCGCTAGAG TACTTTGAAG AGGAAACAGC AATAGGGTTG CTACCAGTAT 
11641 AAATAGACAG GTACATACAA CACTGGAAAT GGTTGTCTGT TTGAGTACGC TTTCAATTCA 
117 01 TTTGGGTGTG CAC 
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Figure 



2-Hybrid Vector with 
Activation Domain 
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acg cac act act etc taa tga gca acg gta 
tgc gtg tga tga gag att act cgt tgc cat 

ttg aat ttg aaa taa aaa aag ttt gec get 
aac tta aac ttt att ttt ttc aaa egg cga 



cct gca att att aat ctt ttg ttt cct 
gga cgt taa taa tta gaa aac aaa gga 



cgt 
gca 



tac ggc ctt cct 
atg ccg gaa gga 

ttg eta tea agt 
aac gat agt tea 

cat tgt tct cgt 
gta aca aga gca 



tec 
agg 

ata 
tat 

tec 
agg 



agt 
tea 

aat 
tta 

ctt 
gaa 



tac 
atg 

aga 
tct 

tct 
aga 





c aag ctt la'Eb ccc a^S £&g aag 
ttc gaa tac ggg, tte ttc ttc 



gaa gat acc cca cca aac 
ctt eta tgg ggt ggt 

u Y K K A 



ttjcL 



tac 



aac atg 



aaa aaa get 
ttt 



mm 



cca aaa aaa 
ttt ttt 



egg 
gee 

D 
gag 

etc 



ttt 1 1 



gaa cga gaa acg taa a j 
cga ctt get ctt tgc att t 



aag gtc teg age 
ttc cag age teg 

<S <3 S M 

ggt ggg t"cg aat 
cca ccc age tta 

7// 



ggc 
ccg 

&- 

caa 

gtt 



gee 
egg 



T $ 



aca 
tgt 



agt 
tea 
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pDEST22 8923 bp 



Location (Base Nos . ) 



904 . 
1388 
1638 
2417 
2639 
2985 
3831 
4334 
6110 
8344 



1248 

.1264 

.2297 

.2501 

.2944 

.3109 

.4318 

.5176 

.7194 

.866 



Gene Encoded 
GAL 4 AD 
attRl 
CmR 

inactivated ccdA 

ccdB 

attR2 

fl (fl intergenic region) 

TRP1 

ampR 

pADH (yeast ADH promoter) 



1 TTCATTTGGG TGTGCACTTT ATTATGTTAC AATATGGAAG GGAACTTTAC ACTTCTCCTA 
61 TGCACATATA TTAATTAAAG TCCAATGCTA GTAGAGAAGG GGGGTAACAC CCCTCCGCGC 
121 TCTTTTCCGA TTTTTTTCTA AACCGTGGAA TATTTCGGAT ATC CTTTTGT TGTTTCCGGG 
181 TGTACAATAT GGACTTCCTC TTTTCTGGCA ACCAAACCCA TACATCGGGA TTC CTATAAT 
241 ACCTTCGTTG GTCTCCCTAA CATGTAGGTG GCGGAGGGGA GATATACAAT AGAACAGATA 
3 01 C C AG AC AAG A C ATAATGGG C TAAACAAGAC TACACCAATT ACACTGCCTC ATTGATGGTG 
361 GTACATAACG AACTAATACT GTAGCCCTAG ACTTGATAGC CATCATCATA TCGAAGTTTC 
421 ACTACCCTTT TTC CATTTG C CATCTATTGA AGTAATAATA GGCGCATGCA ACTTCTTTTC 
481 TTTTTTTTTC TTTTCTCTCT CCCCCGTTGT TGTCTCACCA TATCCGCAAT GACAAAAAAA 
541 ATGATGGAAG ACACTAAAGG AAAAAATTAA CGACAAAGAC AG C AC C AAC A GATGTCGTTG 
6 01 TTCCAGAGCT GATGAGGGGT ATCTTCGAAC ACACGAAACT TTTTCCTTCC TTCATTCACG 
661 CACACTACTC TCTAATGAG C AACGGTATAC GGCCTTCCTT CCAGTTACTT GAATTTGAAA 
721 TAAAAAAAGT TTGCCGCTTT GCTATCAAGT ATAAATAGAC CTG CAATTAT TAAT C TTTTG 
781 TTTCCTCGTC ATTGTTCTCG TTCCCTTTCT TCCTTGTTTC TTTTTCTGCA CAATATTTCA 
841 AG CTATAC CA AGCATACAAT CAACTCCAAG CTTATGCCCA AGAAGAAGCG GAAGGTCTCG 
9 01 AGCGGCGCCA ATTTTAAT C A AAGTGGGAAT ATTGCTGATA GCTCATTGTC CTTCACTTTC 
961 ACTAACAGTA GCAACGGTCC GAACCTCATA ACAACTCAAA CAAATTCTCA AGCGCTTTCA 
1021 CAACCAATTG CCTCCTCTAA CGTTCATGAT AACTTCATGA ATAATGAAAT CACGGCTAGT 
1081 AAAATTGATG ATGGTAATAA TTC AAAAC CA CTGTCACCTG GTTGGACGGA CCAAACTGCG 
1141 TATAACGCGT TTGGAATCAC TACAGGGATG TTTAATAC C A CTACAATGGA TGATGTATAT 
12 01 AACTATCTAT TCGATGATGA AGATACCCCA CCAAACCCAA AAAAAGAGGG TGGGTCGAAT 

12 61 CAAACAAGTT TGTACAAAAA AGCTGAACGA GAAACGTAAA ATGATATAAA TATCAATATA 
1321 TTAAATTAGA TTTTG C ATAA AAAACAGACT ACATAATACT GTAAAACACA ACATATCCAG 

13 81 TCACTATGGC GGCCGCTAAG TTGGCAG CAT CACCCGACGC ACTTTGCGCC GAATAAATAC 

14 41 CTGTGACGGA AGATCACTTC GCAGAATAAA TAAATCCTGG TGTCCCTGTT GATACCGGGA 

15 01 AGCCCTGGGC CAACTTTTGG CGAAAATGAG ACGTTGATCG GCACGTAAGA GGTTC CAACT 
1561 TT C AC C ATAA TGAAATAAGA TCACTACCGG GCGTATTTTT TGAGTTATCG AGATTTTCAG 
1621 GAGCTAAGGA AG CTAAAATG GAGAAAAAAA TCACTGGATA TAC CACCGTT GATATATC C C 

16 81 AATGGCATCG TAAAGAACAT TTTGAGG CAT TTCAGTCAGT TGCTCAATGT AC C TAT AAC C 
1741 AGACCGTTCA GCTGGATATT ACGGCCTTTT TAAAGAC CGT AAAGAAAAAT AAGCACAAGT 
18 01 TTTATCCGGC CTTTATTCAC ATTCTTGCCC GCCTGATGAA TGCTCATCCG GAATTCCGTA 
18 61 TGGCAATGAA AGACGGTGAG CTGGTGATAT GGGATAGTGT TCACCCTTGT TACAC CGTTT 
1921 TCCATGAGCA AACTGAAACG TTTTCATCGC TCTGGAGTGA ATAC CACGAC GATTTCCGGC 
1981 AGTTTCTACA CATATATTCG CAAGATGTGG CGTGTTACGG TG AAAAC CTG GCCTATTTCC 
2 041> CTAAAGGGTT TATTGAGAAT ATGTTTTTCG TCTCAGCCAA TCCCTGGGTG AGTTT C AC C A 
2101 GTTTTGATTT AAACGTGGCC AATATGGACA ACTTCTTCGC CCCCGTTTTC ACCATGGGCA 
2161 AATATTATAC GCAAGGCGAC AAGGTGCTGA TGCCGCTGGC GATTCAGGTT CATCATGCCG 
2221 TCTGTGATGG CTTCCATGTC GGCAGAATGC TTAATGAATT ACAACAGTAC TGCGATGAGT 

22 81 GGCAGGGCGG GGCGTAATCT AGAGGATCCG GCTTACTAAA AGC CAGATAA CAGTATGCGT 

23 41 ATTTGCGCGC TGATTTTTGC GGTATAAGAA TATATACTGA TATGTATACC CGAAGTATGT 

24 01 CAAAAAGAGG TGTGCTATGA AGCAGCGTAT TACAGTGACA GTTGACAGCG ACAGCTATCA 
24 61 GTTGCTCAAG GCATATATGA TGTCAATATC TCCGGTCTGG TAAGCACAAC CATGCAGAAT 
2 521 GAAGCCCGTC GTCTGCGTGC CGAACGCTGG AAAG C GG AAA ATCAGGAAGG GATGG CTGAG - 
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2 581 GTCGCCCGGT TTATTGAAAT GAACGGCTCT TTTGCTGACG AGAACAGGGA CTGGTGAAAT 
2 641 G C AG TTTAAG GTTTACACCT ATAAAAGAGA GAGCCGTTAT CGTCTGTTTG TGGATGTACA 
2 701 GAGTGATATT ATTGACACGC CCGGGCGACG G ATGGTG AT C CCCCTGGCCA GTGCACGTCT 
2761 GCTGTCAGAT AAAGTCTCCC GTGAACTTTA CCCGGTGGTG CATATCGGG3 ATGAAAGCTG 
2 821 GCGCATGATG ACCACCGATA TGGCCAGTGT GCCGGTCTCC GTTATCGGGG AAGAAGTGGC 
2 881 TGATCTCAGC CACCGCGAAA ATGACATCAA AAACGCCATT AACCTGATGT TCTGGGGAAT 

2 941 ATAAATGTCA GGCTCCCTTA TAC AC AG CCA GTCTGCAGGT CGACCATAGT GACTGGATAT 

3 001 GTTGTGTTTT ACAGTATTAT GTAGTCTGTT TTTTATGCAA AATCTAATTT AATATATTGA 
3 061 TATTTATATC ATTTTACGTT TCTCGTTCAG CTTTCTTGTA CAAAGTGGTT TGATGGCCGC 
3121 TAAGTAAGTA AGACGTCGAG CTCTAAGTAA GTAACGGCCG CCACCGCGGT GGAGCTTTGG 
3181 ACTTCTTCGC CAGAGGTTTG GTCAAGTCTC CAATCAAGGT TGTCGGCTTG TCTACCTTGC 
3241 CAGAAATTTA CGAAAAGATG GAAAAGGGTC AAATCGTTGG TAGATACGTT GTTGACACTT 
33 01 CTAAATAAGC GAATTTCTTA TGATTTATGA TTTTTATTAT TAAATAAGTT ATAAAAAAAA 
33 61 TAAGTGTATA CAAATTTTAA AGTGACTCTT AGGTTTTAAA ACGAAAATTC TTATTCTTGA 
3421 GTAACTCTTT CCTGTAGGTC AGGTTGCTTT CTCAGGTATA GCATGAGGTC GCTCTTATTG 
3481 ACCACACCTC TACCGGCATG CCGAGCAAAT GCCTGCAAAT CGCTCCCCAT TTCACCCAAT 
3 541 TGTAGATATG CTAACTCCAG CAATGAGTTG ATGAATCTCG GTGTGTATTT TATGTCCTCA 
36 01 GAGGACAATA CCTGTTGTAA TCGTTCTTCC ACACGGATCC CAATTCGCCC TATAGTGAGT 
36 61 CGTATTACAA TTCACTGGCC GTCGTTTTAC AACGTCGTGA CTGGGAAAAC CCTGGCGTTA 
3 721 CCCAACTTAA TCGCCTTGCA GCACATCCCC CTTTCGCCAG CTGGCGTAAT AGCGAAGAGG 
3 781 CCCGCACCGA TCGCCCTTCC CAACAGTTGC GCAGCCTGAA TGGCGAATGG ACGCGCCCTG 
3 841 TAGCGGCGCA TTAAGCGCGG CGGGTGTGGT GGTTACGCGC AGCGTGACCG CTACACTTGC 
3 9 01 CAGCGCCCTA GCGCCCGCTC CTTTCGCTTT CTTCCCTTCC TTTCTCGCCA CGTTCGCCGG 

3 961 CTTTCCCCGT CAAGCTCTAA ATCGGGGGCT CCCTTTAGGG TTCCGATTTA GTGCTTTACG 

4 021 GCACCTCGAC CCCAAAAAAC TTGATTAGGG TGATGGTTCA CGTAGTGGGC CATCGCCCTG 
4 081 ATAGACGGTT TTTCGCCCTT TGACGTTGGA GTCCACGTTC TTTAATAG TG GACTCTTGTT 
4141 CCAAACTGGA ACAACACTCA ACCCTATCTC GGTCTATTCT TTTGATTTAT AAGGGATTTT 
42 01 GCCGATTTCG GCCTATTGGT TAAAAAATGA GCTGATTTAA CAAAAATTTA ACGCGAATTT 

42 61 TAACAAAATA TTAACGTTTA CAATTTCCTG ATGCGGTATT TTCTCCTTAC GCATCTGTGC 
4 321 GGTATTTCAC ACCGCAGGCA AGTG C AC AAA CAATACTTAA ATAAATACTA CTCAGTAATA 

43 81 ACCTATTTCT TAG C ATTTTT GACGAAATTT GCTATTTTGT TAGAGTCTTT TACACCATTT 
4 441 GTCTCCACAC CTCCGCTTAC ATCAACACCA ATAAC G C CAT TTAATCTAAG CGCATCACCA 
4 501 ACATTTTCTG GCGTCAGTCC ACCAGCTAAC ATAAAATGTA AGCTTTCGGG GCTCTCTTGC 
45 61 CTTCCAACCC AGTCAGAAAT CGAGTTCCAA TCCAAAAGTT CACCTGTCCC ACCTGCTTCT 
4 621 GAATCAAACA AGGGAATAAA CGAATGAGGT TTCTGTGAAG CTGCACTGAG TAGTATGTTG 
4681 CAGTCTTTTG G AAATAC GAG TCTTTTAATA ACTGG C AAAC CGAGGAACTC TTGGTATTCT 
4 741 TGCCACGACT CATCTCCATG CAGTTGGACG ATATCAATGC CGTAATCATT GACCAGAGCC 
4 8 01 AAAACATCCT CCTTAGGTTG ATTACGAAAC ACGCCAACCA AGTATTTCGG AGTGCCTGAA 
4861 CTATTTTTAT ATGCTTTTAC AAGACTTGAA ATTTTCCTTG CAATAACCGG GTCAATTGTT 
4 921 CTCTTTCTAT TGGGCACACA TATAATACCC AGCAAGTCAG CATCGGAATC TAGAGCACAT 

4 981 TCTGCGGCCT CTGTGCTCTG CAAGCCGCAA ACTTTCACCA ATGGAC C AG A ACTAC CTGTG 
5041 AAATTAATAA CAGACATACT CCAAGCTGCC TTTGTGTGCT TAATCAC GTA TACTCACGTG 
5101 CTCAATAGTC ACCAATGCCC TCCCTCTTGG CCCTCTCCTT TTCTTTTTTC GACCGAATTA 
5161 ATTCTTAATC GGCAAAAAAA GAAAAGCTCC GGATCAAGAT TGTACGTAAG GTGACAAGCT 
5221 ATTTTTCAAT AAAGAATATC TTCCACTACT GCCATCTGGC GTCATAACTG CAAAGTACAC 
5*2 81 AT ATATTAC G ATGCTGTCTA TTAAATGCTT CCTATATTAT ATATATAGTA ATGTCG TTTA 

53 41 TGGTGCACTC TCAGTACAAT CTGCTCTGAT GCCGCATAGT TAAGCCAGCC CCGACACCCG 

54 01 CCAACACCCG CTGACGCGCC CTGACGGGCT TGTCTGCTCC CGGCATCCGC TTACAGACAA 
54 61 GCTGTGACCG TCTCCGGGAG CTGCATGTGT CAGAGGTTTT CACCGTCATC ACCGAAACGC 
5521 GCGAGACGAA AGGGCCTCGT GATACGCCTA TTTTTATAGG TTAATGTCAT GATAATAATG 

5 581 GTTTCTTAGG ACGGATCGCT TGCCTGTAAC TTACACGCGC CTCGTATCTT TTAATGATGG 

56 41 AATAATTTGG GAATTTACTC TGTGTTTATT TATTTTTATG TTTTGTATTT GGATTTTAGA 

57 01 AAGTAAATAA AGAAGGTAGA AGAGTTACGG AATGAAGAAA AAAAAATAAA CAAAGGTTTA 

57 61 AAAAATTTCA AC AAAAAG C G TACTTTACAT ATATATTTAT TAGACAAGAA AAGCAGATTA 
5 821 AATAGATATA CATTCGATTA ACGATAAGTA AAATGTAAAA TCACAGGATT TTCGTGTGTG 

58 81 GTCTTCTACA CAGACAAGAT GAAACAATTC GGCATTAATA CCTGAGAGCA GGAAGAGCAA 

5 941 GATAAAAGGT AGTATTTGTT GGCGATCCCC CTAGAGTCTT TTACATCTTC GGAAAACAAA 

6 001 AACTATTTTT TCTTTAATTT CTTTTTTTAC TTTCTATTTT TAATTTATAT ATTTATATTA- 
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6 061 AAAAATTTAA ATTATAATTA TTTTTATAGC ACGTGATGAA AAGGACCCAG GTGGCACTTT 
6121 TCGGGGAAAT GTGCGCGGAA CCCCTATTTG TTTATTTTTC TAAATACATT CAAATATGTA 
6181 TCCGCTCATG AGACAATAAC CCTGATAAAT GCTTCAATAA TATTGAAAAA GGAAGAGTAT 
6241 GAGTATTCAA CATTTCCGTG TCGCCCTTAT TCCCTTTTTT GCGGCATTTT GCCTTCCTGT 
6 3 01 TTTTGCTCAC CCAGAAACGC TGGTGAAAGT AAAAGATGCT GAAGATCAGT TGGGTGCACG 
6 361 AGTGGGTTAC ATCGAACTGG ATCTCAACAG CGGTAAGATC CTTGAGAGTT TTCGCCCCGA 
6 421 AGAACGTTTT CCAATGATGA GCACTTTTAA AGTTCTGCTA TGTGGCGCGG T ATT AT C C C G 
6 481 TATTGACGCC GGGCAAGAGC AACTCGGTCG CCGCATACAC TATTCTCAGA ATGACTTGGT 
6 541 TGAGTACTCA CCAGTCACAG AAAAGCATCT TACGGATGGC ATGACAGTAA GAGAATTATG 
66 01 CAGTGCTGCC ATAACCATGA GTGATAACAC TGCGGCCAAC TTACTTCTGA CAACGATCGG 
6661 AGGAC CGAAG GAGCTAACCG CTTTTTTTCA CAACATGGGG GATCATGTAA CTCGCCTTGA 
6 721 TCGTTGGGAA CCGGAGCTGA ATGAAG CC AT ACCAAACGAC GAGCGTGACA CCACGATGCC 
6781 TGTAG C AATG GCAACAACGT TGCGCAAACT ATTAACTGGC GAACTACTTA CTCTAGCTTC 
6 841 CCGGCAACAA TTAATAGACT GGATGGAGGC GGATAAAGTT GCAGGACCAC TTCTGCGCTC 
6 9 01 GGCCCTTCCG GCTGGCTGGT TTATTGCTGA TAAATCTGGA GCCGGTGAGC GTGGGTCTCG 

6 961 CGGTATCATT GCAGCACTGG GGCCAGATGG TAAGCCCTCC CGTATCGTAG TTATCTACAC 

7 021 GACGGGCAGT CAGGCAACTA TGGATGAACG AAATAGACAG ATCGCTGAGA TAGGTGCCTC 
70 81 AC TG ATTAAG CATTGGTAAC TGTCAGACCA AGTTTACTCA TATATACTTT AGATTGATTT 
7141 AAAACTTCAT TTTTAAT TT A AAAGGAT CTA GGTGAAGATC CTTTTTGATA ATCTCATGAC 
72 01 CAAAATCCCT TAACGTGAGT TTTCGTTCCA CTGAGCGTCA GACCCCGTAG AAAAGATCAA 

72 61 AGGATCTTCT TGAGATCCTT TTTTTCTGCG CGTAATCTGC TGCTTGCAAA CAAAAAAACC 

73 21 ACCGCTACCA GCGGTGGTTT GTTTGCCGGA T CAAGAG CTA CCAACTCTTT TTCCGAAGGT 
7 381 AACTGGCTTC AGCAGAGCGC AG ATAC C AAA TACTGTC CTT CTAGTGTAGC CGTAGTTAGG 
7441 CCACCACTTC AAGAACTCTG TAGCACCGCC TACATACCTC GCTCTGCTAA TCCTGTTACC 
75 01 AGTGGCTGCT GCCAGTGGCG ATAAGTCGTG TCTTACCGGG TTGGACTCAA GACGATAGTT 
7561 ACCGGATAAG GCGCAGCGGT CGGGCTGAAC GGGGGGTTCG TG C AC AC AG C CCAGCTTGGA 
7 621 GCGAACGACC TACACCGAAC TGAGATACCT ACAGCGTGAG CATTGAGAAA GCGCCACGCT 
7681 TCCCGAAGGG AGAAAGGCGG ACAGGTATCC GGTAAGCGGC AGGGTCGGAA CAGGAGAGCG 
7741 C AC GAGGGAG CTTCCAGGGG GGAACGCCTG GTATCTTTAT AGTCCTGTCG GGTTTCGCCA 
78 01 CCTCTGACTT GAGCGTCGAT TTTTGTGATG CTCGTCAGGG GGGCCGAGCC TATGGAAAAA 
78 61 CGCCAGCAAC GCGGCCTTTT TACGGTTCCT GGCCTTTTGC TGGCCTTTTG CTCACATGTT 
7 921 CTTTC CTGCG TTATCCCCTG ATTCTGTGGA TAAC CGTATT ACCGCCTTTG AGTGAGCTGA 

7 981 TACCGCTCGC CGCAGCCGAA CGACCGAGCG CAGCGAGTCA GTGAGCGAGG AAGCGGAAGA 

8 041 GCGCCCAATA CGCAAACCGC CTCTCCCCGC GCGTTGGCCG ATTCATTAAT GCAGCTGGCA 
8101 CGACAGGTTT CCCGACTGGA AAGCGGGCAG TGAGCGCAAC GCAATTAATG TGAGTTACCT 
8161 CACTCATTAG GCACCCCAGG CTTTACACTT TATGCTTCCG GCTCCTATGT TGTGTGGAAT 
8 221 TGTGAGCGGA TAACAATTTC ACACAGGAAA CAGCTATGAC CATGATTACG CCAAGCTCGG 
82 81 AATTAAC C CT CACTAAAGGG AACAAAAGCT GGGTACCGGG CCCCCCCTCG AGATCCGGGA 
8 3 41 TCGAAGAAAT GATGGTAAAT GAAATAGGAA ATCAAGGAGC ATGAAGGCAA AAGACAAATA 
84 01 TAAGGGTCGA ACGAAAAATA AAGTGAAAAG TGTTGATATG ATGTATTTGG CTTTGCGGCG 
84 61 CCGAAAAAAC GAGTTTACGC AATTGCACAA TCATGCTGAC TCTGTGGCGG ACCCGCGCTC 
8 521 TTGCCGGCCC GGCGATAACG CTGGGCGTGA GGCTGTGCCC GGCGGAGTTT TTTGCGCCTG 
8 581 CATTTTCCAA GGTTTACCCT GCGCTAAGGG GCGAGATTGG AGAAGCAATA AGAATGCCGG 
8 641 TTGGGGTTGC GATGATGACG ACCACGACAA CTGGTGTCAT TATTTAAGTT GCCGAAAGAA 
8 701 CCTGAGTGCA TTTGCAACAT GAGTATACTA GAAGAATGAG CCAAGACTTG CGAGACGCGA 

'8761 GTTTGCCGGT GGTGCGAACA ATAGAG CGAC CATGACCTTG AAGGTGAGAC GCGCATAACC 
8 821 GCTAGAGTAC TTTGAAGAGG AAACAGCAAT AGGGTTGCTA CCAGTATAAA TAGACAGGTA 
8 881 CATACAACAC TGGAAATGGT TGTCTGTTTG AGTACG CTTT CAA 



WO 00/52027 /_ ^ / PCT/US00/05432 




His6 carboxy-fusion vector, T7 promoter^ 

205 ate ccg cga aat | taa tac gac tea eta tap gda gab cac aac ggt btc cct 
tag 99c get tta |att ct^ r agt gat atic c3t ctg^t^_tt;cj cca aag gga 



2 56 eta gat cbaaagttt 
gat eta dtg ttc aaa 



gtacaaaaa age tga acg aga aac gta aaa tga tat ^ 



cat gtt"^bt tcgi act tqc tct ttcr cat ttt act atajy 



// — Cdd E # 



1888 ttt tta tgc aaa ate taa ttt aat ata ttg ata ttt ata tea ttt tac gtt 

aaa aat acg ttt tag att aaa tta tat aac tat aaa tat agt aaa atg caa 

/£S**£2. ft F, tr ¥ K V V . X M S Y X H H 

1939 *' tct cgt tea get t tjc ttg tac aaa gtg gtg 1 att atg teg tac tac cat cac 



aga gca agt cga aag aac atgi ttt ca c cac taa tac age atg atg^gta gtg „ 
^ M hi H lfH5 — Er <St term. 

1990 cat cab cat cac etc gat gag caa 1 taa eta gca taa ccc ctt ggg gec tct 
^gta gtg- gta cftg^ q-acr eta etc gtt att gat cgt att ggg gaa ccc egg aga 




Ahdf 
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pDEST23 6264 bp 



Location (Base Nos . 
285. .161 
394 . . 1053 
1173 . . 1257 
1395 . . 1700 
1741. . 1865 
1883 . . 1911 
2574. .3434 
3583 . .4222 



Gene Encoded 

attRl 

CmR 

inactivated ccdA 

ccdB 

attR2 

his6 

ampR 

ori 



1 TCTTCCCCAT CGGTGATGTC GGCGATATAG 
61 ATGCCGGCCA CGATGCGTCC GGCGTAGAGG 
121 G ACT C ACT AT AGGGAGACCA CAACGGTTTC 
181 TGAACGAGAA ACGTAAAATG ATATAAATAT 
241 ACAGACTACA TAAT AC TGT A AAACACAACA 
3 01 ACCCCAGGCT TTACACTTTA TGCTTCCGGC 
3 61 CCGGCGAGAT TTTCAGGAGC TAAGGAAGCT 
421 ACCGTTGATA TATCCCAATG GCATCGTAAA 
481 CAATGTACCT AT AAC C AG AC CGTTCAGCTG 
541 AAAAATAAGC ACAAGTTTTA TCCGGCCTTT 
6 01 CATCCGGAAT TCCGTATGGC AATGAAAGAC 
661 CCTTGTTACA CCGTTTTCCA TGAGCAAACT 
721 CACGACGATT TCCGGCAGTT TCTACACATA 
781 AACCTGGCCT ATTTCCCTAA AGGGTTTATT 
841 TGGGTGAGTT TCACCAGTTT TGATTTAAAC 
901 GTTTTCACCA TGGGCAAATA TTATACGCAA 
961 CAGGTTCATC ATGCCGTCTG TGATGGCTTC 
1021 CAGTACTGCG ATGAGTGGCA GGGCGGGGCG 
10 81 AGATAACAGT ATGCGTATTT GCGCGCTGAT 
1141 TATACCCGAA G TATGT C AAA AAGAGGTGTG 

12 01 AC AG C G AC AG CTATCAGTTG CTCAAGGCAT 
1261 CACAACCATG CAGAATGAAG CCCGTCGTCT 
1321 GGAAGGGATG GCTGAGGTCG CCCGGTTTAT 

13 81 CAGGGACTGG TGAAATGCAG TTTAAGGTTT 
1441 TGTTTGTGGA TGTACAGAGT GATATTATTG 
15 01 TGGCCAGTGC ACGTCTGCTG T C AG AT AAAG 
1561 TCGGGGATGA AAGCTGGCGC ATGATGACCA 
1621 TCGGGGAAGA AGTGGCTGAT CTCAGCCACC 
1681 TGATGTTCTG GGGAATATAA ATGTCAGGCT 
1741 CATAGTGACT GGATATGTTG TGTTTTACAG 
18 01 TAATTTAATA TATTGATATT TATATCATTT 
1861 GTGGTGATTA TGTCGTACTA CCATCACCAT 
1921 TAACCCCTTG GGGCCTCTAA ACGGGTCTTG 
1981 TCCGGATATC CACAGGACGG GTGTGGTCGC 
2 041 TAGCGAAGCG AGCAGGACTG GGCGGCGGCC 
2101 TGCGCATAGA AATTGCATCA ACGCATATAG 
216IL GCTGTCGGAA TGGACGATAT CCCGCAAGAG 
22 21 GCCTACAGCA TCCAGGGTGA CGGTGCCGAG 
22 81 ACACGGTGCC TGACTGCGTT AGCAATTTAA 
2 341 GATGATAAGC TGTCAAACAT GAGAATTCTT 
24 01 TTTTATAGGT TAATGTCATG ATAATAATGG 
24 61 GAAATGTGCG CGGAACCCCT ATTTGTTTAT 
2 521 TCATGAGACA ATAACCCTGA TAAATGCTTC 
2 581 T T C AAC AT TT CCGTGTCGCC CTTATTCCCT 
2 641 CTCACCCAGA AACGCTGGTG AAAGTAAAAG 



GCGCCAGCAA CCGCACCTGT GGCGCCGGTG 
ATCGAGATCT CGATCCCGCG AAATTAATAC 
CCTCTAGATC ACAAGTTTGT ACAAAAAAGC 
CAATATATTA AATTAGATTT TGCATAAAAA 
TATCCAGTCA CTATGGCGGC CGCATTAGGC 
TCGTATAATG TGTGGATTTT GAGTTAGGAT 
AAAATGGAGA AAAAAATCAC TGGATATACC 
GAACATTTTG AGGCATTTCA GTCAGTTGCT 
GATATTACGG CCTTTTTAAA GACCGTAAAG 
ATTCACATTC TTGCCCGCCT GATGAATGCT 
GGTGAGCTGG TGATATGGGA TAGTGTTCAC 
GAAACGTTTT CATCGCTCTG GAGTGAATAC 
TATTCGCAAG ATGTGGCGTG TTACGGTGAA 
GAGAATATGT TTTTCGTCTC AGCCAATCCC 
GTGGCCAATA TGGACAACTT CTTCGCCCCC 
GGCGACAAGG TGCTGATGCC GCTGGCGATT 
CATGTCGGCA GAATGCTTAA TGAATTACAA 
TAAACGCGTG GATCCGGCTT AC TAAAAG C C 
TTTTGCGGTA TAAGAATATA TACTGATATG 
CTATGAAGCA GCGTATTACA GTGACAGTTG 
ATATGATGTC AATATCTCCG GTCTGGTAAG 
GCGTGCCGAA CGCTGGAAAG CGGAAAATCA 
TGAAATGAAC GGCTCTTTTG CTGACGAGAA 
ACACCTATAA AAGAGAGAGC CGTTATCGTC 
ACACGCCCGG GCGACGGATG GTGATCCCCC 
TCTCCCGTGA ACTTTACCCG GTGGTGCATA 
CCGATATGGC CAGTGTGCCG GTCTCCGTTA 
GCGAAAATGA CATCAAAAAC GCCATTAACC 
C C C TT AT AC A CAGCCAGTCT GCAGGTCGAC 
TAT T ATG TAG TCTGTTTTTT ATGCAAAATC 
TACGTTTCTC GTTCAGCTTT CTTGTACAAA 
CACCATCACC TCGATGAGCA ATAACTAGCA 
AGGGGTTTTT TGCTGAAAGG AGGAACTATA 
CATGATCGCG TAGTCGATAG TGGCTCCAAG 
AAAGCGGTCG GACAGTGCTC CGAGAACGGG 
CGCTAGCAGC ACGCCATAGT GACTGGCGAT 
GCCCGGCAGT ACCGGCATAA CCAAGCCTAT 
GATGACGATG AGCGCATTGT TAGATTTCAT 
C TG TG AT AAA CTACCGCATT AAAGCTTATC 
GAAGACGAAA GGGCCTCGTG ATACGCCTAT 
TTTCTTAGAC GTCAGGTGGC ACTTTTCGGG 
TTTTCTAAAT ACATTCAAAT ATGTATCCGC 
AATAATATTG AAAAAGGAAG AGTATGAGTA 
TTTTTGCGGC ATTTTGCCTT CCTGTTTTTG 
ATG CTGAAGA TCAGTTGGGT GCACGAGTGG- 
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2 7 01 GTTACATCGA ACTGGATCTC AACAGCGGTA 
2 761 GTTTTCCAAT GATGAGCACT TTTAAAGTTC 
2 821 ACGCCGGGCA AGAGCAACTC GGTCGCCGCA 
2 881 ACTCACCAGT CACAGAAAAG CATCTTACGG 

2 941 CTGCCATAAC CATGAGTGAT AACACTGCGG 

3 0 01 CGAAGGAGCT AACCGCTTTT TTGCACAACA 
3 061 GGGAACCGGA GCTGAATGAA G C CAT AC C AA 
3121 CAATGGCAAC AACGTTGCGC AAACTATTAA 
3181 AACAATTAAT AG AC TGGATG GAGGCGGATA 
3241 TTCCGGCTGG CTGGTTTATT GCTGATAAAT 
3 3 01 TCATTGCAGC ACTGGGGCCA GATGGTAAGC 
3361 GGAGTCAGGC AACTATGGAT GAACGAAATA 
3421 TTAAGCATTG GTAACTGTCA GACCAAGTTT 
3481 TTCATTTTTA ATTTAAAAGG ATCTAGGTGA 
3 541 TCCCTTAACG TGAGTTTTCG TTCCACTGAG 
3601 CTTCTTGAGA TCCTTTTTTT CTGCGCGTAA 
36 61 TACCAGCGGT GGTTTGTTTG CCGGATCAAG 
3 721 GCTTCAGCAG AGCGCAGATA CCAAATACTG 
3 781 ACTTCAAGAA CTCTGTAGCA CCGCCTACAT 
3 841 CTGCTGCCAG TGGCGATAAG TCGTGTCTTA 
3 901 ATAAGGCGCA GCGGTCGGGC TGAACGGGGG 

3 961 CGACCTACAC CGAACTGAGA TACCTACAGC 

4 021 AAGGGAGAAA GGCGGACAGG TATCCGGTAA 
4 081 GGGAGCTTCC AGGGGGAAAC GCCTGGTATC 
4141 GACTTGAGCG TCGATTTTTG TGATGCTCGT 

42 01 GCAACGCGGC CTTTTTACGG TTCCTGGCCT 
4261 CTGCGTTATC CCCTGATTCT GTGGATAACC 

43 21 CTCGCCGCAG CCGAACGACC GAGCGCAGCG 
43 81 TGATGCGGTA TTTTCTCCTT ACGCATC TGT 
4441 TCTCAGTACA ATCTGCTCTG ATGCCGCATA 
4 5 01 CGTGACTGGG TCATGGCTGC GCCCCGACAC 
4 5 61 GCTTGTCTGC TCCCGGCATC CGCTTACAGA 
4621 TGTCAGAGGT TTTCACCGTC ATCACCGAAA 
46 81 GCGTGGTCGT GAAGCGATTC ACAGATGTCT 
4 741 TTCTCCAGAA GCGTTAATGT CTGGCTTCTG 
48 01 TCCTGTTTGG TCACTGATGC CTCCGTGTAA 
48 61 CCGATGAAAC GAGAGAGGAT G C T C AC GAT A 
4 921 CTGGAACGTT GTGAGGGTAA ACAACTGGCG 

4 981 ACTCAGGGTC AATGCCAGCG CTTCGTTAAT 

5 041 CAGCATCCTG . CGATGCAGAT C CGGAAC ATA 
5101 AGACTTTACG AAACACGGAA ACCGAAGACC 
5161 TTGCAGCAGC AGTCGCTTCA CGTTCGCTCG 
5221 AGGCAACCCC GCCAGCCTAG CCGGGTCCTC 

52 81 GGCCAGGACC CAACGCTGCC CGAGATGCGC 

53 41 ATGGATATGT TCTGCCAAGG GTTGGTTTGC 
'5401 GCTCCAATTC TTGGAGTGGT GAATCCGTTA 

54 61 AGGTGGCCCG GCTCCATGCA CCGCGACGCA 
5521 CGCCTACAAT CCATGCCAAC CCGTTCCATG 

55 81 C GAT C AG C GG TCCAGTGATC GAAGTTAGGC 
5641 GTCCCTGATG GTCGTCATCT ACCTGCCTGG 
57 01 TGCCGCCGGA AG C G AG AAG A AT C ATAATGG 
5 761 C C AG C AAG AC GTAGCCCAGC GCGTCGGCCG 
5 821 CGAAACGTTT GGTGGCGGGA CCAGTGACGA 
5 8 81 ATACCGCAAG CGACAGGCCG ATCATCGTCG 

5 941 TGACCCAGAG CGCTGCCGGC ACCTGTCCTA 

6 001 GTGCGGCGAC GATAGTCATG CCCCGCGCCC 
6 061 TCAAGGGCAT CGGTCGATCG ACGCTCTCCC 
6121 AGTAGTAGGT TGAGGCCGTT GAGCACCGCC 
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AGATC CTTGA GAGTTTTCGC CCCGAAGAAC 
TGCTATGTGG CGCGGTATTA TCCCGTGTTG 
TACACTATTC T C AGAATG AC TTGGTTGAGT 
ATGGCATGAC AGTAAGAGAA TTATGCAGTG 
CCAACTTACT TCTGACAACG ATCGGAGGAC 
TGGGGGATCA TGTAACTCGC CTTGATCGTT 
AC G AC GAG C G TGACACCACG ATGCCTGCAG 
CTGGCGAACT ACTTACTCTA GCTTCCCGGC 
AAGTTGCAGG ACCACTTCTG CGCTCGGCCC 
CTGGAGCCGG TGAGCGTGGG TCTCGCGGTA 
CCTCCCGTAT CGTAGTTATC TACACGACGG 
GACAGATCGC TGAGATAGGT GCCTCACTGA 
ACTCATATAT ACTTTAGATT GATTTAAAAC 
AGATC CTTTT TGATAATCTC ATGACCAAAA 
CGTCAGACCC CGTAGAAAAG ATCAAAGGAT 
TCTGCTGCTT GCAAACAAAA AAACCACCGC 
AGCTACCAAC TCTTTTTCCG AAGGTAACTG 
TCCTTCTAGT GTAGCCGTAG TTAGGCCACC 
ACCTCGCTCT GCTAATCCTG TTAC CAGTGG 
CCGGGTTGGA CTCAAGACGA TAGTTACCGG 
GTTCGTGCAC ACAGCCCAGC TTGGAGCGAA 
GTGAGCTATG AGAAAGCGCC ACGCTTCCCG 
GCGGCAGGGT CGGAACAGGA GAGCGCACGA 
TTTATAGTCC TGTCGGGTTT CGCCACCTCT 
CAGGGGGGCG GAGCCTATGG AAAAACGCCA 
TTTGCTGGCC TTTTGCTCAC ATGTTCTTTC 
GTATTACCGC CTTTGAGTGA GCTGATACCG 
AGTCAGTGAG CGAGGAAGCG GAAGAGCGCC 
GCGGTATTTC ACACCGCATA TATGGTG CAC 
GTTAAGCCAG TATACACTCC GCTATCGCTA 
CCGCCAACAC CCGCTGACGC GCCCTGACGG 
CAAGCTGTGA CCGTCTCCGG GAGCTGCATG 
CGCGCGAGGC AGCTGCGGTA AAGCTCATCA 
GCCTGTTCAT CCGCGTCCAG CTCGTTGAGT 
ATAAAGCGGG CCATGTTAAG GGCGGTTTTT 
GGGGGATTTC TGTTCATGGG GGTAATGATA 
CGGGTTACTG ATGATGAACA TGCCCGGTTA 
GTATGGATGC GGCGGGACCA GAGAAAAATC 
ACAGATGTAG GTGTTCCACA GGGTAGCCAG 
ATGGTGCAGG GCGCTGACTT CCGCGTTTCC 
ATTCATGTTG TTGCTCAGGT CGCAGACGTT 
CGTATCGGTG ATTCATTCTG CTAAC CAGTA 
AACGACAGGA G C ACGATC AT GCGCACCCGT 
CGCGTGCGGC TGCTGGAGAT GGCGGACGCG 
GCATTCACAG TTCTCCGCAA GAATTGATTG 
GCGAGGTGCC GCCGGCTTCC ATTCAGGTCG 
AC GCGGGG AG GCAGACAAGG TATAGGG CGG 
TGCTCGCCGA GGCGGCATAA ATCGCCGTGA 
TGGTAAGAGC CGCGAGCGAT CCTTGAAGCT 
ACAGCATGGC CTGCAACGCG GGCATCCCGA 
GG AAGGC CAT CCAGCCTCGC GTCGCGAACG 
CCATGCCGGC G ATAATGG C C TGCTTCTCGC 
AGGCTTGAGC GAGGGCGTGC AAGATTCCGA 
CGCTCCAGCG AAAGCGGTCC TCGCCGAAAA 
CGAGTTGCAT GATAAAGAAG ACAGTCATAA 
ACCGGAAGGA GCTGACTGGG TTGAAGGCTC 
TTATGCGACT C C TG C AT TAG G AAG C AG C C C 
GCCGCAAGGA ATGGTGCATG CAAGGAGATG- 
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6181 GCGCCCAACA GTCCCCCGGC CACGGGGCCT GCCACCATAC CCACGCCGAA ACAAGCGCTC 
6241 ATGAGCCCGA AGTGGCGAGC CCGA 
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GST carboxy-fusion vector, T7 promoter 



ate gag ate teg ate ccg cga aat 
tag etc tag age tag ggc get tta 



"T"1 ^ONMOi^r 



taa tac gac tea eta ta 
att atg ctg agt gat a 



gg a gac cac 
cq t ctg gtg 

OjUSL 




aac ggt ttc cct eta gat cac aag ttt jflta caa aa a age tga acg aga 
ttg cca aag gga gat eta gjtg ttc aaa cat gtt tjtt teg act tgc tct 



aac 
ttg 



it 



4f 



/A 



4^ 



ted 6 



1735 



17Q6 



OtttlZZ. A Ft" ^ y K V V , X M 5 

//f ^ ^'hBTafct hefc cat tea 'get tt b b bg tac aaa gtg gug att atg tec 
„agt aaa atg caa aa/a gca acrt ccra aa'cr aac atoj ttt cac cab taa tac agg 

— S r — — — SST rro-tevn — T t^ouO 

cct a"Ea eta ggt tat tgg aaa att aag ggc ctt gtg caa ccc act cga ctt 
gga tat gat cca ata acc ttt taa ttc ccg gaa cac gtt ggg tga get gaa 




EcoRI 
Aatll 
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pDEST24 6961 bp 

Location (Base Nos . ) 
195 . . 71 
304 . . 963 
1083 . . 1167 
1305 . . 1610 
1651 . . 1775 
1783 . . 2451 
3181. .4041 
4190. .4829 



Gene Encoded 

attRl 

CmR 

inactivated ccdA 

ccdB 

attR2 

GST 

ampR 

ori 



1 ATCGAGATCT CGATCCCGCG AAATTAATAC 
61 CCTCTAGATC ACAAGTTTGT ACAAAAAAGC 
121 CAATATATTA AATTAGATTT TGCATAAAAA 
181 TATCCAGTCA CTATGGCGGC CGCATTAGGC 
241 TCGTATAATG TGTGGATTTT GAGTTAGGAT 
3 01 AAAATGGAGA AAAAAATCAC T GGATAT AC C 
361 GAACATTTTG AGG CATTTC A GTCAGTTGCT 
421 GATATTACGG CCTTTTTAAA GACCGTAAAG 
481 ATTCACATTC TTGCCCGCCT GATGAATGCT 
541 GGTGAGCTGG TGATATGGGA TAGTGTTCAC 
6 01 GAAACGTTTT CATCGCTCTG GAG TG AATAC 
661 TATTCGCAAG ATGTGGCGTG TTACGGTGAA 
721 GAGAATATGT TTTTCGTCTC AGCCAATCCC 
781 GTGGCCAATA TGGACAACTT CTTCGCCCCC 
841 GGCGACAAGG TGCTGATGCC GCTGGCGATT 
901 CATGTCGGCA GAATGCTTAA TGAATTACAA 
961 TAAACGCGTG GATCCGGCTT ACTAAAAGCC 
1021 TTTTGCGGTA TAAGAATATA T AC TG ATATG 
1081 CTATGAAG C A GCGTATTACA GTGACAGTTG 
1141 ATATGATGTC AATATCTCCG GTCTGGTAAG 
12 01 GCGTGCCGAA CGCTGGAAAG CGGAAAATCA 

12 61 TGAAATGAAC GGCTCTTTTG CTGACGAGAA 
1321 ACACCTATAA AAG AG AG AG C CGTTATCGTC 

13 81 ACACGCCCGG GCGACGGATG GTGATCCCCC 
1441 TCTCCCGTGA ACTTTACCCG GTGGTGCATA 

15 01 CCGATATGGC CAGTGTGCCG GTCTCCGTTA 
1561 GCGAAAATGA CATC AAAAAC GCCATTAACC 
1621 C C C T TAT AC A CAGCCAGTCT GCAGGTCGAC 

16 81 TATTATGTAG TCTGTTTTTT ATGCAAAATC 
1741 TACGTTTCTC GTTCAGCTTT CTTGTACAAA 
18 01 TGGAAAATTA AGGGCCTTGT GCAACCCACT 
18 61 TATGAAGAGC ATTTGTATGA GCGCGATGAA 
1921 TTGGGTTTGG AGTTTCCCAA TCTTCCTTAT 
1,981 TCTATGGCCA T CAT AC G TT A TATAGCTGAC 
2 041 GAGCGTGCAG AGATTTCAAT GCTTGAAGGA 
2101 AGAATTGCAT ATAGTAAAGA CTTTGAAACT 
2161 GAAATGCTGA AAATGTTCGA AGATCGTTTA 
2 221 GTAACCCATC CTGACTTCAT GTTGTATGAC 
2 281 ATGTGCCTGG ATGCGTTCCC AAAATTAGTT 
2 341 CAAATTGATA AGTACTTGAA AT C CAGCAAG 
2 4 01 GCCACGTTTG GTGGTGGCGA CCATCCTCCA 
2 461 TCCGGCTGCT AACAAAGCCC GAAAGGAAGC 
2 521 ACTAGCATAA CCCCTTGGGG CCTCTAAACG 
2 581 AAC T AT AT C C G GAT AT C C AC AGGACGGGTG 
2 641 CTCCAAGTAG CGAAGCGAGC AGGACTGGGC 



GACTCACTAT AGGGAGACCA CAACGGTTTC 

TGAACGAGAA ACGTAAAATG ATATAAATAT 

ACAGACTACA TAATACTGTA AAACACAACA 

ACCCCAGGCT TTACACTTTA TGCTTCCGGC 

CCGGCGAGAT TTTCAGGAGC TAAGGAAGCT 

ACCGTTGATA TATCCCAATG GCATCGTAAA 

CAATGTACCT AT AAC C AG AC CGTTCAGCTG 

AAAAATAAGC ACAAGTTTTA TCCGGCCTTT 

CATCCGGAAT TCCGTATGGC AATGAAAGAC 

C CTTGTT AC A CCGTTTTCCA TGAGCAAACT 

CACGACGATT TCCGGCAGTT TCTACACATA 

AACCTGGCCT ATTTCCCTAA AGGGTTTATT 

TGGGTGAGTT TCACCAGTTT TGATTTAAAC 

GTTTTCACCA TGGGCAAATA TTATACG CAA 

C AGG TT C AT C ATGCCGTCTG TGATGGCTTC 

CAGTACTGCG ATGAGTGGCA GGGCGGGGCG 

AGATAACAGT ATGCGTATTT GCGCGCTGAT 

TATACC CGAA GTATGTCAAA AAGAGGTGTG 

ACAGCGACAG CTATCAGTTG CTCAAGGCAT 

CACAACCATG CAGAATGAAG CCCGTCGTCT 

GGAAGGGATG GCTGAGGTCG CCCGGTTTAT 

CAGGGACTGG TGAAATGCAG TTTAAGGTTT 

TGTTTGTGGA TGTACAGAGT GAT AT TAT TG 

TGGCCAGTGC ACGTCTGCTG T C AG AT AAAG 

TCGGGGATGA AAGCTGGCGC ATGATGACCA 

TCGGGGAAGA AGTGGCTGAT CTCAGCCACC 

TGATGTTCTG GGGAATATAA ATGTCAGGCT 

CATAGTGACT GGATAT GTTG TGTTTTACAG 

TAATTTAATA TATTGATATT TATATCATTT 

GTGGTGATTA TGTCCCCTAT AC T AGGT T AT 
CGACTTCTTT TGGAATATCT TGAAGAAAAA 

GGTGATAAAT GGCGAAACAA AAAGTTTGAA 

TAT AT TG ATG GTGATGTTAA AT T AAC AC AG 
AAGCACAACA TGTTGGGTGG TTGTCCAAAA 
GCGGTTTTGG AT AT TAG AT A CGGTGTTTCG 
CTCAAAGTTG ATTTTCTTAG CAAGCTACCT 
TGTCATAAAA CAT AT TT AAA TGGTGATCAT 
GCTCTTGATG TTGTTTTATA CATGGACCCA 
TGTTTTAAAA AACGTATTGA AGCTATCCCA 
TATATAGCAT GGCCTTTGCA GGGCTGGCAA 
AAAT CGGATC TGGTTCCGCG TCCATGGGGA 
TGAGTTGGCT GCTGCCACCG CTGAGCAATA 
GGTCTTGAGG GGTTTTTTGC TGAAAGGAGG 
TGGTCGCCAT GATCGCGTAG TCGATAGTGG 
GGCGGCCAAA GCGGTCGGAC AGTGCTCCGA — 
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27 01 GAACGGGTGC 

2 761 TGGCGATGCT 

2 8 21 AGCCTATGCC 

2 881 ATTTCATACA 

2 941 GCTTATCGAT 

3 0 01 CGCCTATTTT 
3 061 TTTCGGGGAA 
3121 TATCCGCTCA 
3181 ATGAGTATTC 
3241 GTTTTTGCTC 
3 3 01 CGAGTGGGTT 

33 61 GAAGAACGTT 
3421 CGTGTTGACG 

34 81 GTTGAGTACT 
3 541 TGCAGTGCTG 
3 601 GGAGGACCGA 
3 661 GATCGTTGGG 
3 721 CCTGCAGCAA 
3 781 TCCCGGCAAC 
3 841 TCGGCCCTTC 
3 901 CGCGGTATCA 

3 961 ACGACGGGGA 

4 021 T C AC TGATTA 
4 081 TTAAAACTTC 
4141 AC C AAAATC C 

42 01 AAAGGATCTT 
4261 CCACCGCTAC 
4321 GTAACTGGCT 

43 81 GGCCACCACT 
4441 CCAGTGGCTG 
45 01 TTACCGGATA 
45 61 GAGCGAACGA 
4 621 CTTCCCGAAG 
4681 CGCACGAGGG 

4 741 CACCTCTGAC 
48 01 AACGCCAGCA 
4861 TTCTTTCCTG 
4921 GATACCGCTC 
4981 GAGCGCCTGA 

5 041 GGTGCACTCT 
5101 ATCGCTACGT 
5161 CTGACGGGCT 
5221 CTGCATGTGT 
5281 CTCATCAGCG 
5341 GTTGAGTTTC 
'54 01 GGTTTTTTCC 
54 61 AATGATACCG 
5521 CCGGTTACTG 
5581 , AAAAATCACT 
5641 TAGCC AG CAG 
57 01 CGTTTCCAGA 
57 61 AGACGTTTTG 
5821 ACCAGTAAGG 
5881 CACCCGTGGC 
5941 GGACGCGATG 

6 0 01 TTGATTGGCT 
6 061 CAGGTCGAGG 
6121 AGGGCGGCGC 
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GCATAGAAAT 
GTCGGAATGG 
T AC AG CAT C C 
CGGTGCCTGA 
GATAAGCTGT 
TATAGGTTAA 
ATGTGCGCGG 
TGAGACAATA 
AACATTTCCG 
ACCCAGAAAC 
ACATCGAACT 
TTCCAATGAT 
CCGGGCAAGA 
CACCAGTCAC 
CC ATAAC CAT 
AGGAGCTAAC 
AACCGGAGCT 
TGGCAACAAC 
AATTAATAGA 
CGGCTGGCTG 
TTGCAGCACT 
GTCAGGCAAC 
AGCATTGGTA 
ATTTTTAATT 
CTTAACGTGA 
CTTGAGATCC 
CAGCGGTGGT 
TCAGCAGAGC 
TCAAGAACTC 
CTGCCAGTGG 
AGGCGCAGCG 
CCTACACCGA 
GGAGAAAGGC 
AGCTTCCAGG 
TTGAGCGTCG 
ACGCGGCCTT 
CGTTATCCCC 
GCCGCAGCCG 
TGCGGTATTT 
CAGTACAATC 
GACTGGGTCA 
TGTCTGCTCC 
CAGAGGTTTT 
TGGTCGTGAA 
TCCAGAAGCG 
TGTTTGGTCA 
ATGAAACGAG 
GAACGTTGTG 
CAGGGTCAAT 
CATCCTGCGA 
CTTTACGAAA 
CAGCAGCAGT 
CAACCCCGCC 
CAGGACCCAA 
GATATGTTCT 
CCAATTCTTG 
TGGCCCGGCT 
CTACAATCCA 



TG CAT C AACG 
ACGATATCCC 
AGGGTGACGG 
CTGCGTTAGC 
CAAACATGAG 
TGTCATGATA 
AACCCCTATT 
AC CCTGATAA 
TGTCGCCCTT 
GCTGGTGAAA 
GGATCTCAAC 
GAGCACTTTT 
GCAACTCGGT 
AGAAAAGCAT 
GAGTGATAAC 
CGCTTTTTTG 
G AATGAAG C C 
GTTGCGCAAA 
CTGGATGGAG 
GTTTATTGCT 
GGGGCCAGAT 
TATGGATGAA 
ACTGTCAGAC 
TAAAAGGATC 
GTTTTCGTTC 
TTTTTTTCTG 
TTGTTTGCCG 
G CAG AT AC C A 
TGTAGCACCG 
CGATAAGTCG 
GTCGGGCTGA 
ACTGAGATAC 
GGACAGGTAT 
GGGAAACGCC 
ATTTT TGTG A 
TTTACGGTTC 
TGATTCTGTG 
AACGACCGAG 
TCTCCTTACG 
TGCTCTGATG 
TGGCTGCGCC 
CGGCATCCGC 
CACCGTCATC 
GCGATTCACA 
TTAATGTCTG 
CTGATGCCTC 
AGAGGATGCT 
AGGGTAAACA 
GCCAGCGCTT 
TGCAGATCCG 
CACGGAAACC 
CGCTTCACGT 
AGCCTAGCCG 
CGCTGCCCGA 
GCCAAGGGTT 
GAGTGGTGAA 
CCATGCACCG 
TGCCAACCCG 



CAT AT AG CG C 
GCAAGAGGCC 
TGCCGAGGAT 
AATTTAACTG 
AATTCTTGAA 
ATAATGGTTT 
TGTTTATTTT 
ATGCTTCAAT 
ATTCCCTTTT 
GTAAAAGATG 
AGCGGTAAGA 
AAAGTTCTGC 
CGCCGCATAC 
CTTACGGATG 
ACTGCGGCCA 
CACAACATGG 
ATACCAAACG 
CTATTAACTG 
GCGGATAAAG 
G ATAAATC TG 
GGTAAGCCCT 
CGAAATAGAC 
CAAGTTTACT 
TAGGTGAAGA 
CACTGAGCGT 
CGCGTAATCT 
GATCAAGAGC 
AATACTGTCC 
CCTACATACC 
TGTCTTACCG 
ACGGGGGGTT 
CTACAGCGTG 
CCGGTAAGCG 
TGGTATCTTT 
TGCTCGTCAG 
CTGGCCTTTT 
GATAACCGTA 
CGCAGCGAGT 
CATCTGTGCG 
CCGCATAGTT 
CCGACACCCG 
TTACAGACAA 
ACCGAAACGC 
GATGTCTGCC 
GCTTCTGATA 
CGTGTAAGGG 
CACGATACGG 
ACTGGCGGTA 
CGTTAATACA 
GAACATAATG 
GAAGACCATT 
TCGCTCGCGT 
GGTCCTCAAC 
GATGCGCCGC 
GGTTTGCGCA 
TCCGTTAGCG 
CG ACG C AACG 
TTCCATGTGC 



TAGCAGCACG 
CGGCAGTACC 
GACGATGAGC 
TGATAAACTA 
GACGAAAGGG 
CTTAGACGTC 
TCTAAATACA 
AATATTGAAA 
TTGCGGCATT 
CTGAAGATCA 
TCCTTGAGAG 
TATGTGGCGC 
ACTATTCTCA 
GCATGACAGT 
ACTTACTTCT 
GGGATCATGT 
ACGAGCGTGA 
GCGAACTACT 
TTGCAGGACC 
GAGCCGGTGA 
CCCGTATCGT 
AGATCGCTGA 
CATATATACT 
TCCTTTTTGA 
CAGACCCCGT 
GCTGCTTGCA 
TACCAACTCT 
TT C TAG TGT A 
TCGCTCTGCT 
GGTTGGACTC 
CGTGCACACA 
AGCTATGAGA 
GCAGGGTCGG 
ATAGTCCTGT 
GGGGGCGGAG 
GCTGGCCTTT 
TTACCGCCTT 
CAGTGAGCGA 
GTATTTCACA 
AAGC CAGTAT 
CCAACACCCG 
GCTGTGACCG 
GCGAGGCAGC 
TGTTCATCCG 
AAGCGGGCCA 
GGATTTCTGT 
GTTACTGATG 
TGGATGCGGC 
GATGTAGGTG 
GTGCAGGGCG 
CATGTTGTTG 
ATCGGTGATT 
GACAGGAGCA 
GTGCGGCTGC 
TTCACAGTTC 
AGGTGCCGCC 
CGGGGAGGCA 
TCGCCGAGGC 



CCATAGTGAC 
GGCATAACCA 
GCATTGTTAG 
CCGCATTAAA 
CCTCGTGATA 
AGGTGGCACT 
TTCAAATATG 
AAGGAAGAGT 
TTGCCTTCCT 
GTTGGGTGCA 
TTTTCGCCCC 
GGTATTATCC 
GAATGACTTG 
AAGAGAATTA 
GACAACGATC 
AACTCGCCTT 
CACCACGATG 
TACTCTAGCT 
ACTTCTGCGC 
GCGTGGGTCT 
AGTTATCTAC 
GATAGGTGCC 
TTAGATTGAT 
TAATCTCATG 
AG AAAAG AT C 
AACAAAAAAA 
TTTTCCGAAG 
GCCGTAGTTA 
AATCCTGTTA 
AAGACGATAG 
GCCCAGCTTG 
AAGCGCCACG 
AACAGGAGAG 
CGGGTTTCGC 
CCTATGGAAA 
TGCTCACATG 
TGAGTGAGCT 
GGAAGCGGAA 
CCGCATATAT 
ACACTCCGCT 
CTGACGCGCC 
TCTCCGGGAG 
TGCGGTAAAG 
CGTCCAGCTC 
TGTTAAGGGC 
TCATGGGGGT 
ATGAACATGC 
GGGACCAGAG 
TTCCACAGGG 
CTGACTTCCG 
CTCAGGTCGC 
CAT T C TG CTA 
CGATCATGCG 
TGGAGATGGC 
TCCGCAAGAA 
GGCTTCCATT 
GACAAGGTAT 
GGC ATAAATC — 



WO 00/52027 



PCT/US00/05432 



6181 GCCGTGACGA TCAGCGGTCC AGTGATCGAA GTTAGGCTGG TAAGAGCCGC GAG C GAT C CT 
6241 TGAAGCTGTC CCTGATGGTC GTCATCTACC TGCCTGGACA GCATGGCCTG CAACGCGGGC 
63 01 ATCCCGATGC CGCCGGAAGC GAGAAGAATC ATAATGGGGA AGG C CAT CCA GCCTCGCGTC 
63 61 GCGAACGCCA GCAAGACGTA GCCCAGCGCG TCGGCCGCCA TGCCGGCGAT AATGGCCTGC 
6421 TTCTCGCCGA AACGTTTGGT GGCGGGACCA GTGACGAAGG CTTGAGCGAG GGCGTGCAAG 
6481 ATTCCGAATA CCGCAAGCGA CAGGCCGATC ATCGTCGCGC TCCAGCGAAA GCGGTCCTCG 
6541 CCGAAAATGA CCCAGAGCGC TGCCGGCACC TGTCCTACGA GTTGCATGAT AAAGAAGACA 
66 01 GTCATAAGTG CGGCGACGAT AGTCATGCCC CGCGCCCACC GGAAGGAGCT GACTGGGTTG 
6661 AAGGCTCTCA AGGGCATCGG TCGATCGACG CTCTCCCTTA TGCGACTCCT GCATTAGGAA 
6721 GCAGCCCAGT AGTAGGTTGA GGCCGTTGAG CACCGCCGCC GCAAGGAATG GTGCATGCAA 
6781 GGAGATGGCG CCCAACAGTC CCCCGGCCAC GGGGCCTGCC ACCATACCCA CGCCGAAACA 
6841 AGCGCTCATG AGCCCGAAGT GGCGAGCCCG ATCTTCCCCA TCGGTGATGT CGGCGATATA 
6 901 GGCGCCAGCA ACCGCACCTG TGGCGCCGGT GATGCCGGCC ACGATGCGTC CGGCGTAGAG 



6961 G 
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Thioredoxin carboxy-fusion vector, T7 promoter 



1706 



nag ate teg ate ccg cga aat 
rite tag age tag ggc get tta 



taa tae gac tea eta tc 
a-tt ajbey ctg aat aat & rfa 



gac cac aac 
n t ctg gtg ctg 

^ _ . . .--p^ _ <Wut - 

c aag ttt gta caa aa a age tga a eg aga aac gta 




ggt ttc ect eta gat 

cca aag gga gat eta cfcg ttc a aa cat gtt tfct tea *ct tab teh ttcr cat , , 



173S ^ttt tac gtt tct cgt tea get tdc 



aaaatg caa aga gca agt 



U 
aaa 

ttt 



' -J- M s t> 

ttg tae aaa gtg gtig att atg age gat 
cga aag aac atgjttt cac ca[ c taa tac teg eta 

att aEt cac ctg act gac ^ac^a^^t^gac^ac*^ gat *gta ctc^aaa gcg 
taa taa gtg gac tga ctg ctg tea aaa ctg tgc eta cat gag ttt cgc 
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PshAI 
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pDEST25 6652 bp 



Location (Base Nos . ) 
844 . . 720 
953 . . 1612 
1732 . . 1816 
1954 . . 2259 
2300 . . 2424 
2432 . .2794 



Gene Encoded 

attRl 

CmR 

inactivated ccdA 

ccdB 

attR2 

trx 



1 CCGGAAGCGA GAAGAATCAT AATGGGGAAG GCCATCCAGC CTCGCGTCGC GAACGCCAGC 
61 AAGACGTAGC CCAGCGCGTC GGCCGCCATG CCGGCGATAA TGGCCTGCTT CTCGCCGAAA 
121 CGTTTGGTGG CGGGACCAGT GACGAAGGCT TG AG C GAG GG CGTGCAAGAT TCCGAATACC 
181 GCAAGCGACA GGCCGATCAT CGTCGCGCTC CAGCGAAAGC GGTCCTCGCC GAAAATGACC 
241 CAGAGCGCTG CCGGCACCTG TCCTACGAGT TGCATGATAA AGAAGACAGT CATAAGTGCG 

3 01 GCGACGATAG TCATGCCCCG CGCCCACCGG AAGGAGCTGA CTGGGTTGAA GGCTCTCAAG 
361 GGCATCGGTC GATCGACGCT CTCCCTTATG CGACTCCTGC ATTAGGAAGC AGCCCAGTAG 
421 TAGGTTGAGG CCGTTGAGCA CCGCCGCCGC AAGGAATGGT GCATGCAAGG AGATGGCGCC 

4 81 CAACAGTCCC CCGGCCACGG GGCCTGCCAC CATACCCACG CCGAAACAAG CGCTCATGAG 
541 CCCGAAGTGG CGAGCCCGAT CTTCCCCATC GGTGATGTCG GCGATATAGG CGCCAGCAAC 
6 01 CGCACCTGTG GCGC CGGTGA TGCCGGCCAC GATGCGTCCG GCGTAGAGGA TCGAGATCTC 
661 GATCCCGCGA AATTAATACG ACTCACTATA GGGAG AC C AC AACGGTTTCC CTCTAGATCA 
721 CAAGTTTGTA CAAAAAAGCT GAACGAGAAA CGTAAAATGA TATAAATATC AATATATTAA 
781 ATTAGATTTT GCATAAAAAA CAGACTACAT AATACTGTAA AACACAACAT ATCCAGTCAC 
841 TATGGCGGCC GCATTAGGCA CCCCAGGCTT TACACTTTAT GCTTCCGGCT CGTATAATGT 
901 GTGGATTTTG AGTTAGGATC CGGCGAGATT TTCAGGAGCT AAGGAAGCTA AAATGGAGAA 
961 AAAAATC AC T GG AT AT AC C A C C GT TG AT AT AT CC CAATGG CAT C GT AAAG AACATTTTGA 

1021 GGCATTTCAG TCAGTTGCTC AATGTACCTA T AAC C AG AC C GTTCAGCTGG ATATTACGGC 
1081 CTTTTTAAAG AC CGTAAAGA AAAATAAGCA CAAGTTTTAT CCGGCCTTTA TTCACATTCT 
1141 TGCCCGCCTG ATGAATGCTC ATCCGGAATT CCGTATGGCA ATGAAAGACG GTGAGCTGGT 

12 01 GATATGGGAT AGTGTTCACC CTTGTTACAC CGTTTTCCAT GAGCAAACTG AAACGTTTTC 
1261 ATCGCTCTGG AGTGAATACC ACGACGATTT CCGGCAGTTT CTACACATAT ATTCGCAAGA 
1321 TGTGGCGTGT T AC GGTGAAA ACCTGGCCTA TTTCCCTAAA GGGTTTATTG AGAATATGTT 

13 81 TTTCGTCTCA GCCAATCCCT GGGTGAGTTT C AC C AGTTTT GATTTAAACG TGGCCAATAT 
1441 GGACAACTTC TTCGCCCCCG TTTTCACCAT GGGCAAATAT TATACGCAAG GCGACAAGGT 
15 01 GCTGATGCCG CTGGCGATTC AGGTTCATCA TGCCGTCTGT GATGGCTTCC ATGTCGGCAG 
1561 AATGCTTAAT GAATTACAAC AGTACTGCGA TGAGTGGCAG GGCGGGGCGT AAACGCGTGG 
1621 ATCCGGCTTA CTAAAAGCCA GATAACAGTA TGCGTATTTG CGCGCTGATT TTTGCGGTAT 
1681 AAGAATATAT ACTGATATGT ATACCCGAAG TATGTCAAAA AGAGGTGTGC TATGAAGCAG 
1741 CGTATTACAG TGACAGTTGA CAGCGACAGC TATCAGTTGC TCAAGGCATA TATGATGTCA 
18 01 ATATCTCCGG TCTGGTAAGC ACAACCATGC AGAATGAAGC CCGTCGTCTG CGTGC CGAAC 
18 61 GCTGGAAAGC GGAAAATCAG GAAGGGATGG CTGAGGTCGC CCGGTTTATT GAAATGAACG 
1921 GCTCTTTTGC TGACGAGAAC AGGGACTGGT GAAATGCAGT TTAAGGTTTA CACCTATAAA 
1981 AGAGAGAGCC GTTATCGTCT GTTTGTGGAT GT AC AGAGT G ATATTATTGA CACGCCCGGG 
2 041 CGACGGATGG TGATCCCCCT GGCCAGTGCA CGTCTGCTGT CAGATAAAGT CTCCCGTGAA 
2101 CTTTACCCGG TGGTGCATAT CGGGGATGAA AGCTGGCGCA TGATGACCAC CGATATGGCC 
2161 AGTGTGCCGG TCTCCGTTAT CGGGGAAGAA GTGGCTGATC TCAGCCACCG CGAAAATGAC 
2 221 ATCAAAAACG CCATTAACCT GATGTTCTGG GGAATATAAA TGTCAGGCTC CCTTATACAC 
22 81 AGCCAGTCTG CAGGTCGACC ATAGTGACTG GATATGTTGT GTTTTACAGT ATTATGTAGT 
2 341 CTGTTTTTTA TGCAAAATCT AATTTAATAT ATTGATATTT AT AT CAT T T T ACGTTTCTCG 
2 401 TTCAGCTTTC TTGTACAAAG TGGTGATTAT GAGCGATAAA ATTATTCACC TGACTGACGA 
2 461 CAGTTTTGAC ACGGATGTAC T C AAAG C GG A CGGGGCGATC CTCGTCGATT TCTGGGCAGA 
2 521 GTGGTGCGGT CCGTGCAAAA TGATCGCCCC G ATT C T GG AT GAAATCGCTG ACGAATATCA 
2 581 GGGCAAACTG ACCGTTGCAA AACTGAACAT CGATCAAAAC CCTGGCACTG CGCCGAAATA 
2 641 TGGCATCCGT GGTATCCCGA CTCTGCTGCT GTTCAAAAAC GGTGAAGTGG CGGCAACCAA 
2 7 01 AGTGGGTGCA CTGTCTAAAG GT C AG T T G AA AGAGTTCCTC GACGCTAACC TGGCCGGTTC 

2 761 TGGTTCTGGT GATGACGATG AC AAGGTAC C CGGGGATCGA TCCGGCTGCT AAC AAAG C C C — 
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2 821 GAAAGGAAGC TGAGTTGGCT GCTGCCACCG CTGAGCAATA AC TAG C AT AA CCCCTTGGGG 
2881 CCTCTAAACG GGTCTTGAGG GGTTTTTTGC TGAAAGGAGG AAC TAT AT C C GG AT AT C C AC 

2 941 AGGACGGGTG TGGTCGCCAT GAT C G C GT AG TCGATAGTGG CTCCAAGTAG CGAAGCGAGC 

3 001 AGGACTGGGC GGCGGCCAAA GCGGTCGGAC AGTGCTCCGA GAACGGGTGC G C AT AG AAAT 
3 061 TGCATCAACG CATATAGCGC TAG C AGC ACG C C AT AGTG AC TGGCGATGCT GTCGGAATGG 
3121 AC GAT AT C C C GCAAGAGGCC CGGCAGTACC GG CAT AAC C A AGCCTATGCC TACAGCATCC 
3181 AGGGTGACGG TGCCGAGGAT GACGATGAGC GCATTGTTAG ATTTCATACA CGGTGCCTGA 
3 241 CTGCGTTAGC AATTTAACTG TGATAAACTA C C G C ATT AAA G C TT AT C GAT GATAAGCTGT 
3 3 01 CAAACATGAG AATTCTTGAA G AC GAAAGGG CCTCGTGATA CGCCTATTTT TATAGGTTAA 
3 3 61 TGTCATGATA ATAATGGTTT CTTAGACGTC AGGTGGCACT TTTCGGGGAA ATGTGCGCGG 
3 421 AACCCCTATT TGTTTATTTT T C T AAAT AC A TTCAAATATG TATCCGCTCA TGAGACAATA 
34 81 ACCCTGATAA ATGCTTCAAT AATATTGAAA AAGGAAGAGT ATGAGT ATT C AACATTTCCG 
3 541 TGTCGCCCTT ATTCC CTTTT TTGCGGCATT TTGCCTTCCT GTTTTTGCTC ACCCAGAAAC 
3 6 01 GCTGGTGAAA GTAAAAGATG CTGAAGATCA GTTGGGTGCA CGAGTGGGTT ACATCGAACT 
3 661 GGATCTCAAC AGCGGTAAGA TCCTTGAGAG TTTTCGCCCC GAAGAACGTT TTCCAATGAT 
3 721 GAGCACTTTT AAAGTTCTGC TATGTGGCGC GGTATTATCC CGTGTTGACG CCGGGCAAGA 
3 7 81 GCAACTCGGT CGCCGCATAC ACTATTCTCA GAATGACTTG GTTGAGTACT CACCAGTCAC 
3 841 AGAAAAGCAT CTTACGGATG GCATGACAGT AAGAGAATTA TGCAGTGCTG C CAT AAC CAT 
3 901 GAGTGATAAC ACTGCGGCCA ACTTACTTCT GACAACGATC GGAGGACCGA AGGAGCTAAC 

3 961 CGCTTTTTTG CACAACATGG GGGATCATGT AACTCGCCTT GATCGTTGGG AACCGGAGCT 

4 021 GAATGAAGCC ATACCAAACG ACGAGCGTGA CACCACGATG CCTGCAGCAA TGGCAACAAC 
4 081 GTTGCGCAAA CTATTAACTG GCGAACTACT TACTCTAGCT TCCCGGCAAC AATTAATAGA 
4141 CTGGATGGAG GCGGATAAAG TTGCAGGACC ACTTCTGCGC TCGGCCCTTC CGGCTGGCTG 

42 01 GTTTATTGCT GAT AAAT C TG GAGCCGGTGA GCGTGGGTCT CGCGGTATCA TTGCAGCACT 
4 261 GGGGCCAGAT GGTAAGCCCT CCCGTATCGT AGTTATCTAC ACGACGGGGA GTCAGGCAAC 
4 321 TATGGATGAA CGAAATAGAC AGATCGCTGA GATAGGTGCC TCACTGATTA AGCATTGGTA 

43 81 ACTGTCAGAC CAAGTTTACT CATATATACT TTAGATTGAT TTAAAACTTC ATTTTTAATT 

44 41 TAAAAGGATC TAGGTGAAGA TCCTTTTTGA TAATCTCATG ACCAAAATCC CTTAACGTGA 
4 5 01 GTTTTCGTTC CACTGAGCGT CAGACCCCGT AG AAAAG AT C AAAGGATCTT CTTGAGATCC 
4 561 TTTTTTTCTG CGCGTAATCT GCTGCTTGCA AACAAAAAAA CCACCGCTAC CAGCGGTGGT 
4 621 TTGTTTGCCG GATCAAGAGC TACCAACTCT TTTTCCGAAG GTAACTGGCT T C AG C AG AGC 
46 81 GCAGATACCA AATACTGTCC TTCTAGTGTA GCCGTAGTTA GGCCACCACT TCAAGAACTC 
4 741 TGTAGCACCG CCTACATACC TCGCTCTGCT AATCCTGTTA CCAGTGGCTG CTGC CAGTGG 
4 8 01 CGATAAGTCG TGTCTTACCG GGTTGGACTC AAGACGATAG T T AC CG GAT A AGGCGCAGCG 
4861 GTCGGGCTGA ACGGGGGGTT CGTGCACACA GCCCAGCTTG GAGCGAACGA CCTACACCGA 
4 921 AC TG AGAT AC CTACAGCGTG AGCTATGAGA AAGCGCCACG CTTCCCGAAG GGAGAAAGGC 

4 981 GGACAGGTAT CCGGTAAGCG GCAGGGTCGG AACAGGAGAG CGCACGAGGG AGCTTCCAGG 

5 041 GGGAAACGCC TGGTATCTTT ATAGTCCTGT CGGGTTTCGC CACCTCTGAC TTGAGCGTCG 
5101 ATT TTTGTGA TGCTCGTCAG GGGGGCGGAG CCTATGGAAA AACGCCAGCA ACGCGGCCTT 
5161 TTTACGGTTC CTGGCCTTTT GCTGGCCTTT TGCTCACATG TTCTTTCCTG CGTTATCCCC 
5221 TGATTCTGTG GATAACCGTA TTACCGCCTT TGAGTGAGCT GATACCGCTC GCCGCAGCCG 
52 81 AACGACCGAG CGCAGCGAGT CAGTGAGCGA GGAAGCGGAA GAGCGCCTGA TGCGGTATTT 
5 3 41 TCTCCTTACG CATCTGTGCG GTATTTCACA C C G CAT AT AT GGTGCACTCT CAGTACAATC 
54 01 TGCTCTGATG CCGCATAGTT AAGCCAGTAT ACACTCCGCT ATCGCTACGT GACTGGGTCA 
54 61 TGGCTGCGCC CCGACACCCG CCAACACCCG CTGACGCGCC CTGACGGGCT TGTCTGCTCC 
5 521 CGGCATCCGC TTACAGACAA GCTGTGACCG TCTCCGGGAG CTGCATGTGT CAGAGGTTTT 
5 581 CACCGTCATC ACCGAAACGC GCGAGGCAGC TG CGGTAAAG CTCATCAGCG TGGTCGTGAA 
5641 GCGATTCACA GATGTCTGCC TGTTCATCCG CGTCCAGCTC GTTGAGTTTC TCCAGAAGCG 
5 7 01, TTAATGTCTG GCTTCTGATA AAGCGGGCCA TGTTAAGGGC GGTTTTTTCC TGTTTGGTCA 
5 761 CTGATGCCTC CGTGTAAGGG GGATTTCTGT TCATGGGGGT AATG AT AC C G ATGAAACGAG 
5 821 AGAGGATGCT C AC GAT AC GG GTTACTGATG ATGAACATGC CCGGTTACTG GAACGTTGTG 
58 81 AGGGTAAACA ACTGGCGGTA TGGATGCGGC GGGACCAGAG AAAAATCACT CAGGGTCAAT 

5 941 GCCAGCGCTT CGTTAATACA GATGTAGGTG TTCCACAGGG TAG C C AG C AG CATCCTGCGA 

6 001 TGCAGATCCG GAACATAATG GTGCAGGGCG CTGACTTCCG CGTTTCCAGA C T TT AC G AAA 
6 061 CACGGAAACC GAAGACCATT CATGTTGTTG CTCAGGTCGC AGACGTTTTG CAGCAGCAGT 
6121 CGCTTCACGT TCGCTCGCGT ATCGGTGATT CATTCTGCTA ACCAGTAAGG CAACCCCGCC 
6181 AGCCTAGCCG GGTCCTCAAC G AC AG GAG C A CGATCATGCG CACCCGTGGC CAGGACCCAA 

6 241 CGCTGCCCGA GATGCGCCGC GTGCGGCTGC TGGAGATGGC GG ACG CGATG GATATGTTCT — 
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6 3 01 GCCAAGGGTT GGTTTGCGCA TTCACAGTTC 

63 61 GAGTGGTGAA TCCGTTAGCG AGGTGCCGCC 
6 421 CCATGCACCG CGACGCAACG CGGGGAGGCA 

64 81 TGCCAACCCG TTCCATGTGC TCGCCGAGGC 
6 541 AGTGAT CGAA GTTAGGCTGG TAAGAGCCGC 
66 01 GTCATCTACC TGC CTGGAC A GCATGGCCTG 



TCCGCAAGAA TTGATTGGCT CCAATTCTTG 
GGCTTCCATT CAGGTCGAGG TGGCCCGGCT 
GACAAGGTAT AGGGCGGCGC CTACAATCCA 
GGCATAAATC GCCGTGACGA TCAGCGGTCC 
GAGCGATCCT TGAAGCTGTC CCTGATGGTC 
CAACGCGGGC ATCCCGATGC CG 
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pDEST26 His6 Amino Fusion in pCMV Sport-neo 

Vector 



600 



651 



70?. 



753 



804 



855 



ttg acg tea abg gga gtt tgt ttt ggc acc aaa ate aac ggg act ttc caa 
aac tgc agt tac cct caa aca aaa ccg tgg ttt tag ttg ccc tga aag get 



aat gtc gta aca act 
tta cag cat tgt tga 



//- 

' ggt ggg agg tct ata 

//cca ccc tec aga tat 



gga gaa gec ate cac 
cct ctg egg tag gtg 

cca gec tec gga etc 
ggt egg agg cct gag 



ccg ccc cat tga cgc aaa tgg gcg gta ggc gtg tac 

ggc ggg gta act gcg ttt acc cgc cat ccg cac at 

aga t< 

tct age gga 

ggg acc gat 

ccc tgg eta 

tac cat oaf! 



taa gca gag etc gtt tag tga acc gtc 
att cgt etc gag caa ate act tgg cag 



cat cac 



*t H S 

cat cac tct 



gta gtg gta gtg aga 



get gtt ttg acc tec ata gaa gac acc 
cga caa aac tgg agg tat ctt ctg tgg 

tag cct agg ccg egg acc latg gcg tac 
ate gga tec ggc gec tgg tac cgc atg*atg gt 

r? 3 , T" ^ U V \< K A 

aga tea 
tct agt 



ttfa tac 



aca agt tt fatac aaa aaa get 
tgt tea aac atg ttt^ttb cga 



// 

gaa cga gaa 

ctt get ctt y 



BspLU11 I 7092 
Aff\\\ 7092 



Sst\ 518 
SnaB I 292 i Bbs I 581 
Sst It 623 
Xba I 662 
Not I 796 



£~am1105 i 6199 



Xmn I 5600 




EcoH M117 
PflM\ 1343 



Bgl II 1564 
Ssfl 107 I 1650 



Srf\ 1985 
Ava I 1986 
Sma I 1 986 
Sa/I 2245 



Nsp V 4909 
flsrl!4743 



Nhe I 2475 



r//ri11 I 4343 

Narl 4227 
Aff II 4082 
H/nd 111 4050 
Stu I 4031 
BseR 14015 
Stf I 3980 



Bel I 2751 
Mun I 2844 
Hpa I 2855 



Dra III 3346 
Kpn I 3684 

SexA I 3801 
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pDEST26 7481 bp 



Location (Base Nos. ) 
492 . . 509 
619. .519 
752 . . 1411 
1531 . . 1615 
1753 . .2058 
2099. .2223 
2409. .2771 
2966 . .3421 
3485 . .3903 
3948 . .4742 
4806 . . 4854 
5265 . .6125 
6274 . .6913 
7344 . .385 



Gene Encoded 

his6 

attRl 

CmR 

inactivated ccdA 

ccdB 

attR2 

SV40 polyA 

fl intergenic region 

SV4 0 promoter 

neo 

polyA 

Apr 

ori 

CMV promoter 



1 GTAAACTGCC CACTTGGCAG TACATCAAGT 
61 CGTCAATGAC GGTAAATGGC CCGCCTGGCA 
121 TCCTACTTGG CAGTACATCT AC GT ATTAGT 
181 GCAGTACATC AATGGGCGTG GATAGCGGTT 
241 CATTGACGTC AATGGGAGTT TGTTTTGGCA 
3 01 TAACAACTCC GCCCCATTGA CGCAAATGGG 
361 AAGCAGAGCT CGTTTAGTGA ACCGTCAGAT 
421 C C TC CAT AG A AGACACCGGG ACCGATCCAG 
481 TGGCGTACTA CCATCACCAT CACCATCACT 
541 AACGAGAAAC GTAAAATGAT ATAAATATCA 
6 01 AGACTACATA ATACTGTAAA ACACAACATA 
6 61 CCCAGGCTTT ACACTTTATG CTTCCGGCTC 
721 GGCGAGATTT TCAGGAGCTA AGGAAGCTAA 
781 CGTTGATATA TCCCAATGGC ATCGTAAAGA 
841 ATGTACCTAT AACCAGACCG TTCAGCTGGA 
901 AAATAAGCAC AAGTTTTATC CGGCCTTTAT 
961 TCCGGAATTC CGTATGGCAA TGAAAGACGG 
1021 TTGTTACACC GTTTTCCATG AGCAAACTGA 
1081 CGACGATTTC CGGCAGTTTC TACACATATA 
1141 CCTGGCCTAT TTCCCTAAAG GGTTTATTGA 
12 01 GGTGAGTTTC ACCAGTTTTG ATTTAAACGT 

12 61 TTTCACCATG GGCAAATATT ATACGCAAGG 
1321 GGTTCATCAT GCCGTCTGTG ATGGCTTCCA 

13 81 GTACTGCGAT GAGTGGCAGG GCGGGGCGTA 
1441 ATAACAGTAT GCGTATTTGC GCGCTGATTT 

15 01 TACCCGAAGT ATGTCAAAAA GAGGTGTGCT 
1561 AGCGACAGCT ATCAGTTGCT CAAGGCATAT 
1621 CAACCATGCA GAATGAAGCC CGTCGTCTGC 

16 81 AAGGGATGGC TGAGGTCGCC CGGTTTATTG 
1741 GGGACTGGTG AAATGCAGTT TAAGGTTTAC 
18 01. TTTGTGGATG TACAGAGTGA TAT T ATTGAC 

18 61 GCCAGTGCAC GTCTGCTGTC AGATAAAGTC 
1921 GGGGATGAAA GCTGGCGCAT GATGAC CACC 

19 81 GGGGAAGAAG TGGCTGATCT CAGCCACCGC 
2 041 ATGTTCTGGG GAATATAAAT GTCAGGCTCC 
2101 TAGTGACTGG ATATGTTGTG TTT T AC AG T A 
2161 ATTTAATATA TTGATATTTA TATCATTTTA 
22 21 GGTTGATCGC GTGCATGCGA CGTCATAGCT 
22 81 AGGCACTGGC CGTCGTTTTA CAACGTCGTG 



GTATCATATG CCAAGTACGC CCCCTATTGA 
TTATGCCCAG TACATGACCT TATGGGACTT 
CATCGCTATT ACCATGGTGA TGCGGTTTTG 
TGACTCACGG GGATTTCCAA GTCTCCACCC 
CCAAAATCAA CGGGACTTTC CAAAATGTCG 
CGGTAGGCGT GTACGGTGGG AGGTCTATAT 
CGCCTGGAGA CGCCATCCAC GCTGTTTTGA 
CCTCCGGACT CTAGCCTAGG CCGCGGACCA 
CTAGATCAAC AAGTTTGTAC AAAAAAGCTG 
ATATATTAAA TTAGATTTTG CATAAAAAAC 
TCCAGTCACT ATGGCGGCCG CATTAGGCAC 
GTATAATGTG TGGATTTTGA GTTAGGATCC 
AATGGAGAAA AAAATCACTG GATATACCAC 
ACATTTTGAG GCATTTCAGT CAGTTGCTCA 
TAT T AC GG C C TTT TTAAAG A CCGTAAAGAA 
TCACATTCTT GCCCGCCTGA TGAATGCTCA 
TGAGCTGGTG ATATGGGATA GTGTTCACCC 
AACGTTTTCA TCGCTCTGGA GTGAATACCA 
TTCGCAAGAT GTGGCGTGTT ACGGTGAAAA 
GAATATGTTT TTCGTCTCAG CCAATCCCTG 
GGCCAATATG GACAACTTCT TCGCCCCCGT 
CGACAAGGTG CTGATGCCGC TGGCGATTCA 
TGTCGGCAGA ATGCTTAATG AATTACAACA 
AAGATCTGGA TCCGGCTTAC TAAAAGCCAG 
TTGCGGTATA AGAATATATA CTGATATGTA 
ATGAAGCAGC GTATTACAGT GACAGTTGAC 
ATGATGTCAA TATCTCCGGT CTGGTAAGCA 
GTGCCGAACG CTGGAAAGCG GAAAATCAGG 
AAATGAACGG CTCTTTTGCT GACGAGAACA 
ACCTATAAAA GAGAGAGCCG TTATCGTCTG 
ACGCCCGGGC GACGGATGGT GATCCCCCTG 
TCCCGTGAAC TTTACCCGGT GGTG CAT AT C 
GATATGGCCA GTGTGCCGGT CTCCGTTATC 
GAAAATGACA TCAAAAACGC CATTAACCTG 
CTTATACACA GCCAGTCTGC AGGTCGACCA 
TTATGTAGTC TGTTTTTTAT GCAAAATCTA 
CGTTTCTCGT TCAGCTTTCT TGTACAAAGT 
CTCTCCCTAT AGTGAGTCGT ATTATAAGCT 
AC TGGGAAAA CTGCTAGCTT GGGATCTTTG — 
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2 341 TGAAGGAACC TTACTTCTGT GGTGTGACAT AATTGGACAA ACTACCTACA GAG AT TT AAA 
2 401 GCTCTAAGGT AAATATAAAA TTTTTAAGTG TATAATGTGT TAAACTAGCT GCATATGCTT 
2461 GCTGCTTGAG AGTTTTGCTT ACTGAGTATG ATTTATGAAA ATATTATACA CAGGAGCTAG 
2 521 TGATTCTAAT TGTTTGTGTA TTTTAGATTC ACAGTCCCAA GGCTCATTTC AGGCCCCTCA 
2 581 GTCCTCACAG TCTGTTCATG ATCATAATCA GCCATACCAC ATTTGTAGAG GTTTTACTTG 
2 641 CTTTAAAAAA CCTCCCACAC CTCCCCCTGA ACCTGAAACA TAAAATGAAT GCAATTGTTG 
2 701 TTGTTAACTT GTTTATTGCA GCTTATAATG GTTACAAATA AAGCAATAGC ATCACAAATT 
2 761 T C AC AAATAA AGCATTTTTT TCACTGCATT CTAGTTGTGG TTTGTCCAAA CTCATCAATG 
2821 TATCTTATCA TGTCTGGATC GATCCTGCAT TAATGAATCG GCCAACGCGC GGGGAGAGGC 
2881 GGTTTGCGTA TTGGCTGGCG TAATAGCGAA GAGGCCCGCA CCGATCGCCC TTCCCAACAG 

2 941 TTGCGCAGCC TGAATGGCGA ATGGGACGCG CCCTGTAGCG GCGCATTAAG CGCGGCGGGT 

3 0 01 GTGGTGGTTA CGCGCAGCGT G AC C G C T AC A CTTGCCAGCG CCCTAGCGCC CGCTCCTTTC 
3 061 GCTTTCTTCC CTTCCTTTCT CGCCACGTTC GCCGGCTTTC CCCGTCAAGC TCTAAATCGG 
3121 GGGCTCCCTT TAGGGTTCCG ATTTAGTGCT TTACGGC AC C TCGACCCCAA AAAACTTGAT 
3181 TAGGGTGATG GTTCACGTAG TGGGCCATCG C C C TG AT AG A CGGTTTTTCG CCCTTTGACG 
3 241 TTGGAGTCCA CGTTCTTTAA TAGTGGACTC TTGTTCCAAA CTGGAACAAC ACTCAACCCT 
3 3 01 ATCTCGGTCT ATTCTTTTGA TTTATAAGGG ATTTTGCCGA TTTCGGCCTA TTGGTTAAAA 
3 361 AATGAGCTGA TTTAACAAAT ATTTAACGCG AATTTTAACA AAATATTAAC GTTTACAATT 
3421 TCGCCTGATG CGGTATTTTC TCCTTACGCA TCTGTGCGGT ATTTCACACC GCATACGCGG 
3481 ATCTGCGCAG CACCATGGCC TGAAATAACC TCTGAAAGAG GAACTTGGTT AGGTACCTTC 
3 541 TGAGGCGGAA AGAAC CAGCT GTGGAATGTG TGTCAGTTAG GGTGTGGAAA GTCCCCAGGC 
3 6 01 TCCCCAGCAG GCAGAAGTAT GCAAAGCATG CATCTCAATT AGTCAGCAAC CAGGTGTGGA 
3 661 AAGTCC C C AG GCTCCCCAGC AGGCAGAAGT ATGCAAAGCA TGCATCTCAA TTAGTCAGCA 
3 721 ACCATAGTCC CGCCCCTAAC TCCGCCCATC CCGCCCCTAA CTCCGCCCAG TTCCGCCCAT 
3 781 TCTCCGCCCC ATGGCTGACT AATTTTTTTT ATTTATGCAG AGGCCGAGGC CGCCTCGGCC 
3 841 TCTGAG CTAT TCCAGAAGTA GTGAGGAGGC TTTTTTGGAG GCCTAGGCTT TTGCAAAAAG 
3 901 CTTGATTCTT CTGACACAAC AGTCTCGAAC TTAAGGCTAG AGCCACCATG ATTGAACAAG 

3 961 ATGGATTGCA CGCAGGTTCT CCGGCCGCTT GGGTGGAGAG GCTATTCGGC TATGACTGGG 

4 021 CACAACAGAC AATCGGCTGC TCTGATGCCG CCGTGTTCCG GCTGTCAGCG CAGGGGCGCC 
4 081 CGGTTCTTTT TGTCAAGACC GACCTGTCCG GTGCCCTGAA TGAACTGCAG GACGAGGCAG 
4141 CGCGGCTATC GTGGCTGGCC ACGACGGGCG TTCCTTGCGC AGCTGTGCTC GACGTTGTCA 

42 01 CTGAAGCGGG AAGGGACTGG CTGCTATTGG GCGAAGTGCC GGGGCAGGAT CTCCTGTCAT 
4261 CTCACCTTGC TCCTGCCGAG AAAGTATCCA TCATGGCTGA TGCAATGCGG CGGCTGCATA 
4 321 CGCTTGATCC GGCTACCTGC CCATTCGACC AC CAAGCGAA ACATCGCATC GAGCGAGCAC 

43 81 GTACTCGGAT GGAAGCCGGT CTTGTCGATC AGGATGATCT GGACGAAGAG CATCAGGGGC 
4441 TCGCGCCAGC CGAACTGTTC GCCAGGCTCA AGGCGCGCAT GCCCGACGGC GAGGATCTCG 
4501 TCGTGACCCA TGGCGATGCC TGCTTGCCGA ATATCATGGT GGAAAATGGC CGCTTTTCTG 
4 561 GAT T CAT C G A CTGTGGCCGG CTGGGTGTGG CGGACCGCTA T C AGG AC AT A GCGTTGGCTA 
4621 CCCGTGATAT TGCTGAAGAG CTTGGCGGCG AATGGG CTG A CCGCTTCCTC GTGCTTTACG 
46 81 GTATCGCCGC TCCCGATTCG CAGCGCATCG CCTTCTATCG CCTTCTTGAC GAGTTCTTCT 
4 741 GAGCGGGACT CTGGGGTTCG AAATG AC CGA CCAAGCGACG CCCAACCTGC CATC AC G ATG 
4 8 01 GCCGCAATAA AAT AT CTTT A TTTTCATTAC ATCTGTGTGT TGGTTTTTTG TGTGAATCGA 
4 8 61 TAG CG AT AAG GATCCGCGTA TGGTGCACTC TCAGTACAAT CTGCTCTGAT GCCGCATAGT 
4 921 T AAG C C AG C C CCGACACCCG CCAACACCCG CTGACGCGCC CTGACGGGCT TGTCTGCTCC 

4 981 CGGCATCCGC TTACAGACAA GCTGTGACCG TCTCCGGGAG CTGCATGTGT CAGAGGTTTT 
'5041 CACCGTCATC ACCGAAACGC G C G AG AC G AA AGGGCCTCGT GATACG CCTA TTTTTATAGG 

5101 TTAATGTCAT GATAATAATG GTTTCTTAGA CGTCAGGTGG CACTTTTCGG GGAAATGTGC 
5161 GCGGAACCCC TATTTGTTTA TTTTTCTAAA TACATTCAAA TATGTATCCG CTCATGAGAC 
5221, AAT AAC C CT G ATAAATGCTT CAATAATATT GAAAAAGGAA GAGTATGAGT ATTCAACATT 
52 81 TCCGTGTCGC CCTTATTCCC TTTTTTGCGG CATTTTGCCT TCCTGTTTTT GCTCACCCAG 
5341 AAACGCTGGT GAAAGTAAAA GATGCTGAAG ATCAGTTGGG TGCACGAGTG GGTTACATCG 
54 01 AACTGGATCT CAACAGCGGT AAGATCCTTG AGAGTTTTCG CCCCGAAGAA CGTTTTCCAA 
54 61 T GATG AG C AC TTTTAAAGTT CTGCTATGTG GCGCGGTATT AT C C C GT ATT GACGCCGGGC 

5 521 AAG AG C AAC T CGGTCGCCGC AT AC AC TAT T C T C AG AAT G A CTTGGTTGAG TACTCACCAG 
5581 TCACAGAAAA GCATCTTACG GATGGCATGA CAGTAAGAGA ATTATGCAGT GCTGCCATAA 
5 641 CCATGAGTGA T AAC AC T G C G GCCAACTTAC TT CTG AC AAC GATCGGAGGA C C G AAG GAG C 
57 01 TAACCGCTTT TTTGCACAAC ATGGGGGATC ATG T AAC T CG CCTTGATCGT TGGGAAC CGG 

5 761 AGCTGAATGA AG C CAT AC C A AAC G AC GAG C GTGACACCAC GATGCCTGTA GCAATGGCAA - 
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5 821 CAACGTTGCG CAAACTATTA ACTGGCGAAC 
58 81 TAGACTGGAT GGAGGCGGAT AAAGTTGCAG 

5 941 GCTGGTTTAT TG CTG AT AAA TCTGGAGCCG 

6 0 01 CACTGGGGCC AGATGGTAAG CCCTCCCGTA 
60 61 CAACTATGGA TGAAC G AAAT AG AC AG AT C G 
6121 GGTAACTGTC AGACCAAGTT TACTCATATA 
6181 AATTTAAAAG GATCTAGGTG AAGATCCTTT 
6 241 GTGAGTTTTC GTTCCACTGA GCGTCAGACC 
6 3 01 ATCCTTTTTT TCTGCGCGTA ATCTGCTGCT 

63 61 TGGTTTGTTT GCCGGATCAA GAGCTACCAA 

64 21 GAGCGCAGAT ACCAAATACT GTCCTTCTAG 
64 81 ACTCTGTAGC ACCGCCTACA TACCTCGCTC 
6 541 GTGGCGATAA GTCGTGTCTT ACCGGGTTGG 
66 01 AGCGGTCGGG CTGAACGGGG GGTTCGTGCA 
66 61 CCGAACTGAG ATACCTACAG CGTGAGCATT 
6 721 AGGCGGACAG GTATCCGGTA AGCGGCAGGG 
6 781 CAGGGGGAAA CGCCTGGTAT CTTTATAGTC 
6 841 GTCGATTTTT GTGATGCTCG TCAGGGGGGC 
6 9 01 CCTTTTTACG GTTCCTGGCC TTTTGCTGGC 

6 961 CCCCTGATTC TGTGGATAAC CGTATTACCG 

7 021 GCCGAACGAC CGAGCGCAGC GAGTCAGTGA 
7 081 AACCGCCTCT CCCCGCGCGT TGGCCGATTC 
7141 TTCAATATTA TTGAAGCATT TATCAGGGTT 
72 01 GTATTTAGAA AAATAAACAA ATAGGGGTTC 

72 61 ACGTCTAAGA AACCATTATT AT C ATG AC AT 
7 321 CCTTTCACTC ATTAGATG C A TGTCGTTACA 

73 81 CCGCCCAACG ACCCCCGCCC ATTGACGTCA 
7 441 ATAGGGACTT TCCATTGACG TCAATGGGTG 



TACTTACTCT AGCTTCCCGG CAACAATTAA 
GACCACTTCT GCGCTCGGCC CTTCCGGCTG 
GTGAGCGTGG GTCTCGCGGT ATCATTGCAG 
TCGTAGTTAT CTACACGACG GGGAGTCAGG 
CTGAGATAGG TGCCTCACTG ATTAAGCATT 
T AC T T TAG AT TGATTTAAAA CTTCATTTTT 
TTGATAATCT CATGACCAAA ATCCCTTAAC 
CCGTAGAAAA GATCAAAGGA TCTTCTTGAG 
TG C AAAC AAA AAAACCACCG CTACCAGCGG 
CTCTTTTTCC GAAGGTAACT GGCTTCAGCA 
TGTAGCCGTA GTTAGGCCAC CACTTCAAGA 
TGCTAATCCT GTTACCAGTG GCTGCTGCCA 
ACTCAAGACG ATAGTTACCG GATAAGGCGC 
CACAGCCCAG CTTGGAGCGA ACGACCTACA 
GAGAAAGCGC CACGCTTCCC GAAGGGAGAA 
TCGGAACAGG AGAGCGCACG AGGGAGCTTC 
CTGTCGGGTT TCGCCACCTC TGACTTGAGC 
GGAGCCTATG GAAAAACGCC AGCAACGCGG 
CTTTTGCTCA CATGTTCTTT CCTGCGTTAT 
CCTTTGAGTG AGCTGATACC GCTCGCCGCA 
GCGAGGAAGC GGAAGAGCGC CCAATACGCA 
ATTAATGCAG AGCTTGCAAT TCGCGCGTTT 
ATTGTCTCAT GAGCGGATAC AT AT TTG AAT 
CGCGCACATT TCCCCGAAAA GTGCCACCTG 
TAACCTATAA AAATAGGCGT AGTACGAGGC 
TAACTTACGG TAAATGGCCC GCCTGGCTGA 
ATAATGACGT ATGTTCCCAT AGTAACGCCA 
GAGTATTTAC G 
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pDEST 27 GST Amino Fusion inpCMV Sport-iiea 

Vector 

a w ^^^^ iijj^^^r 

/ i-iir- <*r<*rr rrrrrr ann t-ft- haa aca CJ^CT 



600 y nac ggt ggg agg tct ata taa gca gag etc gtt tag tga acc effce aga teg 
/ ntg eca cc c tec aga t-At. att cgt etc g ag caa ate act tgg cagj tct age 

'r ■ * — «... • ■ „ 

651 ect gga gac gec ate cac get gtt ttg acu tec ata gaa gac acc ggg acc 
gga cct ctg egg tag gtg cga caa aac tgg agg tat ctt ctg tgg ccc tgg 

\p\ A P 32 L 

7 02 gat cca gee tec gga etc tag cot agg ccg egg acc £tg gec cct ata eta 
eta ggt egg agg cct gag ate gga tec ggc gee tgg t^c^^g^ga^tat a^t_^_ 

7 S3 ggt tat tgg aaa att aag ggc Ctt gtg caa ccc act cga ctt ctt ttg gaa 
eca ata acc ctt taa ttc ccg gaa cac gtt ggg tga get gaa gaa aac ctt 



(k ST nr^Tei v\S 

*"gag cat 



804 tat ctt gaa gaa aaa tat gaa^ag cat ttg tat gag cge gat gaa ggt gat — ^ 
ata gaa ctt ctt ttt ata ctt etc gta aac ata etc gcg eta ctt eca eta 

V P fl g R 

1365 )} t tfc 99^ ggt ggc gac cat cct eca aaa teg gat ccg gtt ccg cgt tct aga 
// aaa eca eca ccg ctg gta gga ggt ttt age eta gac caa ggc gca aga tct 
c T £ Lr Y K )< A // 



1416 t^ca 



agt 



aca agt tt g tac aaa aaa get gaa cga gaa acg 
tgt tea aac atg ttd ttt ccya ctt g et ctt bac^. 



SnaB I 292 



Sst\ 518 
Bbs I 581 
Sst II 623 
EcoN I 638 



Bsg I 897 
Swa II 1056 



Eam1 105 I 6841 



Xba \ 1304 
Not\ 1438 



Rsr II 5385 




EcoH I 1759 



PfttA I 1985 

Bgl II 2206 
8sf1107l2292 



Srf\ 2627 
Ava I 2628 
Sma I 2628 
Sal I 2887 



TWI11 14985 
Nar\ 4869 

Aft II 4724 
Hind 111 4692 
Stu I 4673 
BseR I 4657 
S/714622 

SexAl 4443 



/Vne 13117 



Man I 3466 
Hpa I 3497 



Kpn I 4326 
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pDEST27 8123 bp (rotated to position 7800) 



Location (Base Nos . ) 
130 . . 793 
803 . . 927 



Gene Encoded 



1036 . . 1695 
1815 . . 1899 
2037 . . 2342 
2383 . .2507 
2693 . .3055 
3250 . .3705 
3769 . .4187 
4232 . .5026 
5090. .5138 
5549 . . 6409 
6558 . . 7197 
7628 . .27 



GST 

attRl 

CmR 

inactivated ccdA 

ccdB 

attR2 

SV4 0 polyA 

fl intergenic region 

SV4 0 promoter 

neo 

polyA 

Apr 

ori 

CMV promoter 



1 ATAAGCAGAG CTCGTTTAGT GAACCGTCAG ATCGCCTGGA GACGCCATCC ACGCTGTTTT 
61 GAC CTCCATA GAAGAC AC CG GGACCGATCC AGCCTCCGGA CTCTAGCCTA GGCCGCGGAC 
121 CATGGCCCCT ATACTAGGTT ATTGGAAAAT TAAGGGCCTT GTGCAACCCA CTCGACTTCT 
181 TTTGGAATAT CTTGAAGAAA AATATGAAGA GCATTTGTAT GAGCGCGATG AAGGTGATAA 
241 ATGGCGAAAC AAAAAGTTTG AATTGGGTTT GGAGTTTCCC AATCTTCCTT ATTATATTGA 
3 01 TGGTGATGTT AAATTAACAC AGTCTATGGC CATCATACGT TAT AT AG C TG ACAAGCACAA 

3 61 CATGTTGGGT GGTTGTCCAA AAGAGCGTGC AGAGATTT C A ATGCTTGAAG GAGCGGTTTT 
421 GGATATTAGA TACGGTGTTT CGAGAATTGC ATATAGTAAA GACTTTGAAA CTCTCAAAGT 

4 81 TGATTTTCTT AGCAAGCTAC CTGAAATGCT GAAAATGTTC GAAGATCGTT TATGTCATAA 
541 AACATATTTA AATGGTGATC ATGTAACCCA TCCTGACTTC ATGTTGTATG ACGCTCTTGA 
6 01 TGTTGTTTTA TACATGGACC CAATGTGCCT GGATGCGTTC CCAAAATTAG TTTGTTTTAA 
661 AAAACGTATT GAAGCTATCC CACAAATTGA TAAGTACTTG AAATC C AG C A AGTAT AT AG C 
721 ATGGCCTTTG CAGGGCTGGC AAGCCACGTT TGGTGGTGGC GACCATCCTC CAAAATCGGA 
781 T CTGGTTC CG CGTTCTAGAT CAACAAGTTT GTACAAAAAA GCTGAACGAG AAACGTAAAA 
841 T G AT AT AAAT AT C AAT AT AT TAAATTAGAT TTTG CAT AAA AAACAGACTA CATAATACTG 
901 TAAAACACAA CATATCCAGT CACTATGGCG G C CG CAT TAG GCACCCCAGG CTTTACACTT 
961 TATGCTTCCG GCTCGTATAA TGTGTGGATT TTGAGTTAGG ATCCGGCGAG ATTTTCAGGA 

1021 GCTAAGGAAG C T AAAATGG A GAAAAAAATC ACTGGATATA CCACCGTTGA TAT AT C C C AA 
1081 TGGCATCGTA AAGAACATTT TGAGGCATTT CAGTCAGTTG CTCAATGTAC CTATAACCAG 
1141 ACCGTTCAGC TGGATATTAC GGCCTTTTTA AAGACCGTAA AGAAAAATAA GCACAAGTTT 

12 01 TATCCGGCCT TT ATT C AC AT TCTTGCCCGC CTGATGAATG CTCATCCGGA ATTCCGTATG 
1261 GCAATGAAAG ACGGTGAGCT GGTGATATGG GATAGTGTTC ACCCTTGTTA C AC CGTTTTC 
1321 CATGAGCAAA CTGAAACGTT TTCATCGCTC TGGAGTGAAT ACCACGACGA TTTCCGGCAG 

13 81 TTTCTACACA TATATTCGCA AGATGTGGCG TGTTACGGTG AAAACCTGGC CTATTTCCCT 
1441 AAAGGGTTTA TTGAGAATAT GTTTTTCGTC TCAGCCAATC CCTGGGTGAG TTTCACCAGT 

15 01 TTTGATTTAA ACGTGGCCAA TATGGACAAC TTCTTCGCCC CCGTTTTCAC CATGGGCAAA 
1561 TATTATACGC AAGGCGACAA GGTGCTGATG CCGCTGGCGA TTCAGGTTCA TCATGCCGTC 
1621 TGTGATGGCT TCCATGTCGG CAGAATGCTT AATGAATTAC AACAGTACTG CGATGAGTGG 

16 81 CAGGGCGGGG CGTAAAGATC TGGATCCGGC TTACTAAAAG C C AG AT AAC A GTATGCGTAT 
1741 TTGCGCGCTG ATTTTTGCGG TATAAGAATA TATACTGATA T GT AT AC C C G AAGTATGTCA 
18 01 AAAAGAGG TG TGCTATGAAG CAGCGTATTA CAGTGACAGT TGACAGCGAC AGCTATCAGT 
1861 TGCTCAAGGC AT AT ATGAT G T C AAT AT C T C CGGTCTGGTA AGCACAACCA TGCAGAATGA 
1921 AGCCCGTCGT CTGCGTGCCG AACGCTGGAA AGCGGAAAAT CAGGAAGGGA TGGCTGAGGT 
1981 CGCCCGGTTT ATTGAAATGA ACGGCTCTTT TGCTGACGAG AACAGGGACT GGTGAAATGC 
2 041 AGTTTAAGGT TTACAC CTAT AAAAGAGAGA GCCGTTATCG TCTGTTTGTG GATGTACAGA 
2101 GTGATATTAT TGACACGCCC GGGCGACGGA TGGTGATCCC CCTGGCCAGT GCACGTCTGC 
2161 TGTCAGATAA AGTCTCCCGT GAACTTTACC CGGTGGTGCA TATCGGGGAT GAAAGCTGGC 
2221 GCATGATGAC C AC CG ATATG GCCAGTGTGC CGGTCTCCGT TATCGGGGAA GAAGTGGCTG 
22 81 ATCTCAGCCA CCGCGAAAAT GACATCAAAA ACGCCATTAA CCTGATGTTC TGGGGAATAT 
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2 341 AAATGTCAGG CTCCCTTATA C AC AG C C AG T CTGCAGGTCG AC C AT AGTG A CTGGATATGT 
24 01 TGTGTTTTAC AGTATTATGT AGTCTGTTTT TTATGCAAAA TCTAATTTAA TAT AT TG AT A 
2 461 TTTATATCAT TTTACGTTTC TCGTTCAGCT TTCTTGTACA AAGTGGTTGA TCGCGTGCAT 
2 521 GCGACGTCAT AGCTCTCTCC CTATAGTGAG TCGTATTATA AGCTAGGCAC TGGCCGTCGT 
2 581 TTTACAACGT CGTGACTGGG AAAACTGCTA GCTTGGGATC TTTGTGAAGG AAC CTTACTT 
2 641 CTGTGGTGTG ACATAATTGG ACAAACTACC TACAGAGATT TAAAGCTCTA AGGTAAATAT 
2701 AAAATTTTTA AGTGTATAAT GTGTTAAACT AGCTGCATAT GCTTGCTGCT TGAGAGTTTT 
2 761 GCTTACTGAG TATGATTTAT GAAAATATTA TACACAGGAG CTAGTGATTC TAATTG TTTG 
2 821 TGTATTTTAG ATTCACAGTC CCAAGGCTCA TTTCAGGCCC CTCAGTCCTC ACAGTCTGTT 

2 8 81 CATGATCATA ATCAGC CATA CCACATTTGT AGAGGTTTTA CTTGCTTTAA AAAACCTCCC 
2941 ACACCTCCCC CTGAACCTGA AACATAAAAT GAATGCAATT GTTGTTGTTA ACTTGTTTAT 

3 0 01 TGCAGCTTAT AATGGTTACA AATAAAGCAA TAGCATCACA AATTTCACAA ATAAAGCATT 
3 061 TTTTTCACTG CATTCTAGTT GTGGTTTGTC CAAAC TC ATC AATGTATCTT ATCATGTCTG 
3121 GATCGATCCT GCATTAATGA ATCGGCCAAC GCGCGGGGAG AGGCGGTTTG CGTATTGGCT 
3181 GGCGTAATAG CGAAGAGGCC CGCACCGATC GCCCTTCCCA ACAGTTGCGC AGCCTGAATG 
3241 GCGAATGGGA CGCGCCCTGT AGCGGCGCAT TAAGCGCGGC GGGTGTGGTG GTTACGCGCA 
3 3 01 GCGTGACCGC TACACTTGCC AGCGCCCTAG CGCCCGCTCC TTTCGCTTTC TTCCCTTCCT 
3 3 61 TTCTCGCCAC GTTCGCCGGC TTTCCCCGTC AAGCTCTAAA TCGGGGGCTC CCTTTAGGGT 
3421 TCCGATTTAG TGCTTTACGG CACCTCGACC CCAAAAAACT TGATTAGGGT GATGGTTCAC 
34 81 GTAGTGGGCC ATCGCCCTGA TAGACGGTTT TTCGCCCTTT GACGTTGGAG TCCACGTTCT 
3 541 TTAATAGTGG ACTCTTGTTC CAAACTGGAA CAACACTCAA CCCTATCTCG GTCTATTCTT 
3 6 01 TTGATTTATA AGGGATTTTG CCGATTTCGG CCTATTGGTT AAAAAATGAG CTGATTTAAC 
3 661 AAATATTTAA CGCGAATTTT AACAAAATAT TAACGTTTAC AATTTCGCCT GATGCGGTAT 
3 7 21 TTTCTCCTTA CGCATCTGTG CGGTATTTCA CACCGCATAC GCGGATCTGC G C AG C AC CAT 
3 781 GGCCTGAAAT AACCTCTGAA AGAGGAACTT GGTTAGGTAC CTTCTGAGGC GGAAAGAACC 
3 841 AGCTGTGGAA TGTGTGTCAG TTAGGGTGTG GAAAGTCCCC AGGCTCCCCA GCAGGCAGAA 
3 901 GTATGCAAAG CATGCATCTC AATTAGTCAG CAACCAGGTG TGGAAAGTCC CCAGGCTCCC 

3 961 CAGCAGGCAG AAGTATGCAA AGCATGCATC TCAATTAGTC AGCAACCATA GTCCCGCCCC 

4 021 TAACTCCGCC CATCCCGCCC CTAACTCCGC CCAGTTCCGC CCATTCTCCG CCCCATGGCT 
4 081 GACTAATTTT TTTTATTTAT GCAGAGGCCG AGGCCGCCTC GGCCTCTGAG CTATTCCAGA 
4141 AGTAGTGAGG AGGCTTTTTT GGAGGCCTAG GCTTTTGCAA AAAGCTTGAT TCTTCTGACA 
42 01 CAACAGTCTC GAACTTAAGG CTAGAGCCAC CATGATTGAA CAAGATGGAT TGCACGCAGG 

42 61 TTCTCCGGCC GCTTGGGTGG AGAGGCTATT CGGCTATGAC TGGGCACAAC AGACAATCGG 
4 3 21 CTGCT CTG AT GCCGCCGTGT TCCGGCTGTC AGCGCAGGGG CGCCCGGTTC TTTTTGTCAA 

43 81 GACCGACCTG TCCGGTGCCC TGAATGAACT GCAGGACGAG GCAGCGCGGC TATCGTGGCT 
4441 GGCCACGACG GGCGTTCCTT GCGCAGCTGT GCTCGACGTT GTCACTGAAG CGGGAAGGGA 
4 5 01 CTGGCTGCTA TTGGGCGAAG TGCCGGGGCA GGATCTCCTG TCATCTCACC TTGCTCCTGC 
4 561 CGAGAAAGTA TCCATCATGG CTGATGCAAT GCGGCGGCTG CATACGCTTG ATCCGGCTAC 
4 621 CTGCCCATTC GACCACCAAG CGAAACATCG CATCGAGCGA GCACGTACTC GGATGGAAGC 
46 81 CGGTCTTGTC GATCAGGATG ATCTGGACGA AGAGCAT C AG GGGCTCGCGC CAGCCGAACT 
4 741 GTTCGCCAGG CTCAAGGCGC GCATGCCCGA CGGCGAGGAT CTCGTCGTGA CCCATGGCGA 
4 8 01 TGCCTGCTTG CCGAATATCA TGGTGGAAAA TGGCCGCTTT TCTGGATTCA TCGACTGTGG 
4 8 61 CCGGCTGGGT GTGGCGGACC GCTATCAGGA CATAGCGTTG GCTACCCGTG ATATTGCTGA 
4 921 AGAGCTTGGC GGCGAATGGG CTGACCGCTT CCTCGTGCTT TACGGTATCG CCGCTCCCGA 

4 981 TTCGCAGCGC ATCGCCTTCT ATCGCCTTCT TGACGAGTTC TTCTGAGCGG GACTCTGGGG 
'5041 TTCGAAATGA CCGACCAAGC GACGCCCAAC CTGCCATCAC GATGGCCGCA AT AAAAT AT C 
5101 TTTATTTTCA TTACATCTGT GTGTTGGTTT TTTGTGTGAA TCGATAGCGA TAAGGATCCG 
5161 CGTATGGTGC ACTCTCAGTA CAATCTGCTC TGATGCCGCA TAGTTAAGCC AGCCCCGACA 
52 21, CCCGCCAACA CCCGCTGACG CGCCCTGACG GGCTTGTCTG CTCCCGGCAT CCGCTTACAG 
52 81 ACAAGCTGTG ACCGTCTCCG GGAGCTGCAT GTGTCAGAGG TTTTCACCGT CATCACCGAA 
5341 AC G CG C GAGA CGAAAGGGCC TCGTGATACG CCTATTTTTA TAGGTTAATG TCATGATAAT 
54 01 AATGGTTTCT TAGACGTCAG GTGGCACTTT TCGGGGAAAT GTGCGCGGAA CCCCTATTTG 
54 61 TTTATTTTTC TAAATACATT CAAATATGTA TCCGCTCATG AGACAATAAC CCTGATAAAT 

5 521 GCTTCAATAA TATTGAAAAA GGAAGAGTAT GAGTATTCAA CATTTCCGTG TCGCCCTTAT 
5 581 TCCCTTTTTT GCGGCATTTT GCCTTCCTGT TTTTGCTCAC CCAGAAACGC TGGTGAAAGT 
5 641 AAAAGATGCT GAAGATCAGT TGGGTGCACG AGTGGGTTAC ATCGAACTGG ATCTCAACAG 
5 7 01 CGGTAAGATC CTTGAGAGTT TTCGCCCCGA AGAACGTTTT C C AATGATG A GCACTTTTAA 

5 761 AGTTCTGCTA TGTGGCGCGG TAT TAT CCCG TATTGACGCC GGGCAAGAGC AACTCGGTCG — 
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5 821 CCGCATACAC TATTCTCAGA ATGACTTGGT TGAGTACTCA CCAGTCACAG AAAAGCATCT 
5 881 TACGGATGGC ATGACAGTAA GAGAATTATG CAGTGCTGCC ATAACCATGA GTGATAACAC 

5 941 TGCGGCCAAC TTACTTCTGA CAACGATCGG AGGACCGAAG GAGCTAACCG CTTTTTTGCA 

6 001 CAACATGGGG GATCATGTAA CTCGCCTTGA TCGTTGGGAA CCGGAGCTGA ATG AAG C CAT 
6 061 ACCAAACGAC GAGCGTGACA CCACGATGCC TGTAGCAATG GCAACAACGT TGCG CAAACT 
6121 ATTAACTGGC GAACTACTTA CTCTAGCTTC CCGGCAACAA TTAATAGACT GGATGGAGGC 
6181 GGATAAAGTT GCAGGACCAC TTCTGCGCTC GGCCCTTCCG GCTGGCTGGT TTATTGCTGA 
6241 TAAATCTGGA GCCGGTGAGC GTGGGTCTCG CGGTAT CATT GCAGCACTGG GGCCAGATGG 
63 01 TAAGCCCTCC CGTATCGTAG TTATCTACAC GACGGGGAGT CAGGCAACTA TGGATGAACG 
63 61 AAATAGACAG ATCGCTGAGA TAGGTGCCTC ACTGATTAAG CATTGGTAAC TGTCAGACCA 
6421 AGTTTACTCA TATATACTTT AGATTGATTT AAAACTTCAT TTTTAATTTA AAAGGATCTA 
6 481 GGTGAAGATC CTTTTTGATA ATCTCATGAC CAAAATCCCT TAACGTGAGT TTTCGTTCCA 
6 541 CTGAGCGTCA GACCCCGTAG AAAAGATCAA AGGATCTTCT TGAGATCCTT TTTTTCTGCG 
66 01 CGTAATCTGC TGCTTGCAAA CAAAAAAACC ACCGCTACCA GCGGTGGTTT GTTTGC CGGA 
6661 T C AAG AG C T A CCAACTCTTT TTCCGAAGGT AACTGGCTTC AGCAGAGCGC AG ATAC C AAA 
6721 TACTGTCCTT CTAGTGTAGC CGTAGTTAGG CCACCACTTC AAGAACTCTG TAGCACCGCC 
6 781 TACATACCTC GCTCTGCTAA TCCTGTTACC AGTGGCTGCT GCCAGTGGCG ATAAGTCGTG 
6 841 TCTTACCGGG TTGGACTCAA GACGATAGTT AC C GGAT AAG GCGCAGCGGT CGGGCTGAAC 
6 901 GGGGGGTTCG TGCACACAGC CCAGCTTGGA GCGAACGACC TACACCGAAC TG AG ATAC CT 

6 961 ACAGCGTGAG CAT TGAGAAA GCGCCACGCT TCCCGAAGGG AGAAAGGCGG ACAGGTATCC 

7 021 GGTAAGCGGC AGGGTCGGAA CAGGAGAGCG CACGAGGGAG CTTCCAGGGG GAAACGCCTG 
70 81 GTATCTTTAT AGTCCTGTCG GGTTTCGCCA CCTCTGACTT GAGCGTCGAT TTTTGTGATG 
7141 CTCGTCAGGG GGGCGGAGCC TATGGAAAAA CGCCAGCAAC GCGGCCTTTT TACGGTTCCT 
72 01 GGCCTTTTGC TGG CCTTTTG CTCACATGTT CTTTCCTGCG TTATCCCCTG ATTCTGTGGA 

72 61 TAACCGTATT ACCGCCTTTG AGTGAG CTG A TACCGCTCGC CGCAGCCGAA CGACCGAGCG 
7321 CAGCGAGTCA GTGAGCGAGG AAG CGGAAG A GCGCCCAATA CGCAAACCGC CTCTCCCCGC 

73 81 GCGTTGGCCG ATT C ATTAAT GCAGAGCTTG CAATTCGCGC GTTTTTCAAT ATTATTGAAG 
7441 CAT T TAT C AG GGTTATTGTC TCATGAGCGG ATACATATTT GAATGTATTT AGAAAAATAA 
7 5 01 ACAAATAGGG GTTCCGCGCA CATTTCCCCG AAAAGTGCCA CCTGACGTCT AAG AAAC CAT 
7 561 T ATT AT CATG AC ATTAAC C T ATAAAAATAG GCGTAGTACG AGGCCCTTTC ACTCATTAGA 
7621 TGCATGTCGT TACATAACTT ACGGTAAATG GCCCGCCTGG CTGACCGCCC AACGACCCCC 
76 81 GCCCATTGAC GTCAATAATG ACGTATGTTC CCATAGTAAC GCCAATAGGG ACTTTCCATT 
7 741 GACGTCAATG GGTGGAGTAT TTACGGTAAA CTGCCCACTT GGCAGTACAT CAAGTGTATC 
78 01 AT ATG C C AAG TACGCCCCCT ATTGACGTCA ATGACGGTAA ATGGCCCGCC TGG CATT ATG 
7861 CCCAGTACAT GACCTTATGG GACTTTCCTA CTTGGCAGTA CATCTACGTA TTAGTCATCG 
7 921 CTATTACCAT GGTGATGCGG TTTTGGCAGT ACATCAATGG GCGTGGATAG CGGTTTGACT 

7 981 CACGGGGATT TCCAAGTCTC CACCCCATTG ACGTCAATGG GAGTTTGTTT TGGCACCAAA 

8 041 ATCAACGGGA CTTTC CAAAA TGTCGTAACA ACTCCGCCCC ATTGACGCAA ATGGG CGGT A 
8101 GGCGTGTACG GTGGGAGGTC TAT 
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Figure m 4-: pEXP501: pCMV-SPORT 6 host for attB Libraries 
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Figures P^svlr^4). Features of the attB clonh* 

^flt^ 50 '- Baseswfthta hatched iSS? 
replaced by cDNA in some LTI cDNA libraries. 
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— ^ ™» c *9 *tc gec tgg aga cgc cat ce* 

— cga gca aat cac ttg gca gtc tag egg afS tct g!g gta ggt 



m ala -a ~ M W *** Ca ° Cgg gac c * a tcc etc 

Srcg aca aaa ctg gag gta tct tct gtg gec ctg get agg teg gag 



* 

egg act eta gec tag gcc ^ l.a, egg ata^ea afcb tea cac agg « 



ace tga gat egg ate cggjSg^ etc gee tat tgt taa agt gtglel 



t?t ate ai! ^ Ct5 " Ctt agg C » a cac ™ ^a.aca 

ttt gte gat act ggt aat ecf gat aaa tee act gtg ata tct 



"a^a 



"age 
tea 



tac aaa aaa gc& ygc 

atg fcfcfc^ttt cot: ccg 




tec gg ja att 



cct taa 



CCC J ggg 



ggg ^ 



ccc 




c£c_2ag Wg ccfc < £g ct t ^cg cg t 
gag ct^c cjpc ggg ttc ga£ tgc g^a 



cc cag ctt tjet^ tgt aca aag 
tgg gt c gaa aga aca tefc ttc 




I lit ^ tta taa gct a ^ cac t« ccg teg 

a gat ate act eag eat aat, att ega tee g ,tg ace ggc aac 



ttt 



aaa Itl tt° ^ act 999 aaa act *9<= tt ff gga tct tta~ 

aaa atg t, tg eag cae tga ccc, ttt tga cfa-4cfe aa? lit aja Hi-. 
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pEXPSOl 4396 bp 



1 CCATTCGCCA TTCAGGCTGC GCAACTGTTG GGAAGGGCGA TCGGTGCGGG CCTCTTCGCT 
61 ATTACGCCAG CCAATACGCA AACCGCCTCT CCCCGCGCGT TGGC CGATTC ATTAATGCAG 
121 GATCGATCCA GACATGATAA GATACATTGA TGAGTTTGGA CAAACCACAA CTAGAATGCA 
181 GTGAAAAAAA TGCTTTATTT GTGAAATTTG TGATGCTATT GCTTTATTTG T AAC CAT TAT 
241 AAGCTGCAAT AAACAAGTTA ACAACAACAA TTGCATTCAT TTTATGTTTC AGGTTCAGGG 

3 01 GGAGGTGTGG GAG GTTTTTT AAAGCAAGTA AAACCTCTAC AAATGTGGTA TGGCTGATTA 
361 TGATCATGAA CAGACTGTGA GGACTGAGGG GCCTGAAATG AGCCTTGGGA CTGTGAATCT 
421 AAAATACACA AACAATTAGA AT CAC TAG CT CCTGTGTATA ATATTTTCAT AAAT CAT AC T 

4 81 CAGTAAGCAA AACTCTCAAG CAGCAAGCAT ATGCAGCTAG TTTAACACAT TATACACTTA 
541 AAAATTTTAT ATTTACCTTA GAGCTTTAAA TCTCTGTAGG TAGTTTGTCC AATTATGTCA 
601 CAC C AC AG AA GTAAGGTTCC TTCACAAAGA TCCCAAGCTA GCAGTTTTCC CAGTCACGAC 
661 GTTGTAAAAC GACGGCCAGT GCCTAGCTTA TAATACGACT CAC T AT AGGG ACCACTTTGT 
7 21 ACAAGAAAGC TGGGTACGCG TAAGCTTGGG CCCCTCGAGG GATCCTCTAG AGCGGCCGCC 
781 GACTAGTGAG CTCGTCGACG ATATCCCGGG AATTCCGGAC CGGTACCAGC CTGCTTTTTT 
841 GTACAAACTT GTTCTATAGT GTCACCTAAA TAGGC CTAAT GGTCATAGCT GTTTCCTGTG 
901 TGAAATTGTT ATCCGCTCCG CGGCCTAGGC TAGAGTCCGG AGGCTGGATC GGTCCCGGTG 
961 TCTTCTATGG AGGTCAAAAC AGCGTGGATG GCGTCTCCAG GCGAT CTGAC GGTTCACTAA 

1021 ACGAGCTCTG C T TAT AT AG A CCTCCCACCG TACACGCCTA CCGCCCATTT GCGTCAATGG 
1081 GGCGGAGTTG TTACGACATT TTGGAAAGTC CCGTTGATTT TGGTGCCAAA ACAAACTCCC 
1141 ATTGACGTCA ATGGGGTGGA GACTTGGAAA TCCCCGTGAG TCAAACCGCT ATCCACGCCC 

12 01 ATTGATGTAC TGCCAAAACC G CAT CAC CAT GGTAATAGCG ATGACTAATA CGTAGATGTA 
1261 CTGCCAAGTA GGAAAGTCCC ATAAGGTCAT GTACTGGGCA TAATGCCAGG CGGGCCATTT 
1321 ACCGTCATTG ACGTCAATAG GGGGCGTACT TGGCATATGA TACACTTGAT GTACTGCCAA 

13 81 GTGGGCAGTT T AC C G T AAAT ACTCCACCCA TTGACGTCAA TGGAAAGTCC CTATTGGCGT 
1441 TACTATGGGA ACATACGTCA TTATTGACGT CAATGGGCGG GGGTCGTTGG GCGGTCAGCC 
1501 AGGCGGGCCA TTTACCGTAA GTTATGTAAC GACATGCATC TAATGAGTGA AAGGGCCTCG 
1561 TACTACGCCT ATTTTTATAG GTTAATGTCA TGATAATAAT GGTTTCTTAG ACGTCAGGTG 
1621 GCACTTTTCG GGGAAATGTG CGCGGAACCC CTATTTGTTT ATTTTTCTAA ATACATTCAA 
16 81 ATATGTATCC GCTCATGAGA CAATAAC CCT GATAAATGCT T C AAT AAT AT TGAAAAACGC 
1741 GCGAATTGCA AGCTCTGCAT TAATGAATCG GCCAACGCGC GGGGAGAGGC GGTTTGCGTA 
1801 TTGGGCGCTC TTCCGCTTCC TCGCTCACTG ACTCGCTGCG CTCGGTCGTT CGGCTGCGGC 
1861 GAGCGGTATC AGCTCACTCA AAGGCGGTAA TACGGTTATC CAC AG AAT C A GGGGATAACG 
1921 CAGGAAAGAA CATGTGAGCA AAAGGCCAGC AAAAGGCCAG GAAC CGTAAA AAGGCCGCGT 
1981 TGCTGGCGTT TTTCCATAGG CTCCGCCCCC CTGACGAGCA T C AC AAAAAT CGACGCTCAA 
2 041 GTCAGAGGTG GCGAAACCCG ACAGGACTAT AAAG AT AC C A GGCGTTTCCC CCTGGAAGCT 
2101 CCCTCGTGCG CTCTCCTGTT CCGACCCTGC CGCTTACCGG ATACCTGTCC GCCTTTCTCC 
2161 CTTCGGGAAG CGTGGCGCTT TCTCAATGCT CACGCTGTAG GTATCTCAGT TCGGTGTAGG 
2221 TCGTTCGCTC CAAGCTGGGC TGTGTGCACG AACCCCCCGT TCAGCCCGAC CGCTGCGCCT 
22 81 TATCCGGTAA CTATCGTCTT GAGTCCAACC CGGTAAGACA CGACTTATCG CCACTGGCAG 
2341 CAGCCACTGG TAACAGGATT AGCAGAGCGA GGTATGTAGG CGGTGCTACA GAGTTCTTGA 
24 01 AGTGGTGGCC TAACTACGGC T AC AC T AG AA GGACAGTATT TGGTATCTGC GCTCTGCTGA 
2461 AGCCAGTTAC CTTCGGAAAA AGAGTTGGTA GCTCTTGATC CGGCAAACAA ACCACCGCTG 
2 521 GTAGCGGTGG TTTTTTTGTT TGCAAGCAGC AG AT T AC G C G CAGAAAAAAA GGATCTCAAG 
2581 AAGATCCTTT GATCTTTTCT ACGGGGTCTG ACGCTCAGTG GAACGAAAAC TCACGTTAAG 
2 641 GGATTTTGGT CATGC CATAA CTTCGTATAG CATACATTAT ACGAAGTTAT GGCATGAGAT 
2 7 01 TAT C AAAAAG GATCTTCACC TAGATCCTTT TAAATTAAAA ATGAAGTTTT AAATCAATCT 
2 761 AAAGTATATA TGAGTAAACT TGGTCTGACA GTTAC CAATG CTTAATCAGT GAGGCACCTA 
2 8 21 TCTCAGCGAT CTGTCTATTT CGTTCATCCA TAGTTGCCTG ACTCCCCGTC G T GT AG AT AA 
2 881 CTACGATACG GGAGGGCTTA CCATCTGGCC CCAGTGCTGC AATGATACCG CGAGACCCAC 

2 941 GCTCACCGGC T C C AG AT TT A TCAGCAATAA ACCAGCCAGC CGGAAGGGCC GAGCGCAGAA 

3 0 01 GTGGTCCTGC AACTTTATCC GCCTCCATCC AG T C T ATT AA TTGTTGCCGG G AAG C TAG AG 
3 061 TAAGTAGTTC GCCAGTTAAT AGTTTGCGCA ACGTTGTTGC CAT T G C T AC A GGCATCGTGG 
3121 TGTCACGCTC GTCGTTTGGT ATGGCTT CAT TCAGCTCCGG TTCCCAACGA TCAAGGCGAG - 
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3181 TT AC AT GAT C CCCCATGTTG TGCAAAAAAG CGGTTAGCTC CTTCGGTCCT CCGATCGTTG 
3 241 TCAGAAGTAA GTTGGCCGCA GTGTTATCAC TCATGGTTAT GGCAGCACTG CATAATTCTC 
3 3 01 TTACTGTCAT GCCATCCGTA AGATGCTTTT CTGTGACTGG TGAGTACTCA ACCAAGTCAT 
3 361 TCTGAGAATA GTGTATGCGG C G AC CG AG TT GCTCTTGCCC GGCGTCAATA CGGGATAATA 
3 421 CCGCGCCACA TAGCAGAACT TTAAAAGTGC T CAT C AT TG G AAAACGTTCT TCGGGGCGAA 
34 81 AACTCTCAAG GATCTTACCG CTGTTGAGAT CCAGTTCGAT GTAACCCACT CGTGCACCCA 
3 541 ACTGATCTTC AGCATCTTTT ACTTTCACCA GCGTTTCTGG GTGAGCAAAA ACAGGAAGGC 
3 6 01 AAAATGCCGC AAAAAAGGGA ATAAGGGCGA CACGGAAATG TTGAATACTC ATACTCTTCC 
3661 TTTTTCAATA TTATTGAAGC ATTTATCAGG GTTATTGTCT CATGCCAGGG GTGGGCACAC 
3 721 ATATTTGATA CCAGCGATCC CT AC AC AG C A CATAATTCAA TGCGACTTCC CTCTATCGCA 
3 7 81 CATCTTAGAC CTTTATTCTC CCTCCAGCAC ACATCGAAGC TGCCGAGCAA GCCGTTCTCA 
3 841 CCAGTCCAAG ACCTGGCATG AGCGGATACA TATTTGAATG TATTTAGAAA AATAAACAAA 
3 901 TAGGGGTTCC GCGCACATTT CCCCGAAAAG TGCCACCTGA AATTGTAAAC GTTAATATTT 

3 961 TGTTAAAATT CGCGTTAAAT TTTTGTTAAA TCAGCTCATT TTTTAACCAA TAGGCCGAAA 

4 021 TCGGCAAAAT CCCTTATAAA TCAAAAGAAT AG AC CG AG AT AGGGTTGAGT GTTGTTCCAG 
4 081 TTTGGAACAA GAGTCCACTA TTAAAGAACG TGGACTCCAA CGTCAAAGGG CGAAAAACCG 
4141 TCTATCAGGG CGATGGCCCA CTACGTGAAC CATCACCCTA ATCAAGTTTT TTGGGGT CGA 
42 01 GGTGCCGTAA AG C AC T AAAT CGGAACCCTA AAGGGAGCCC CCGATTTAGA GCTTGACGGG 

42 61 GAAAGCCGGC GAACGTGGCG AGAAAGGAAG GGAAGAAAGC GAAAGGAGCG GGCGCTAGGG 
4 321 CGCTGGCAAG TGTAGCGGTC ACGCTGCGCG TAACCACCAC ACCCGCCGCG CTTAATGCGC 

43 81 CGCTACAGGG CGCGTC 
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Sbsl4416 
BsrS I 4346 
BsaH I 4298 
A/spl4161 
BspLU^^ 14161 
Afi\\\ 4161 A 
Drd I 4053 ^ 
BssS I 3988 
Hae II 3917 

T1-T2 



Hpal 1 
>4pa I 22 

EcoOWQ I 22 



Eco57 1 3634 



Vsp 13188 



Pvu I 2987 
Sgf\ 2986 

EcoN ! 2898 




BspM I 620 
Bsa I 741 
BsfK I 757 
Mam I 860 
Srf\ 874 
S/na I 875 



BsM107 I 1211 
BssH I! 1250 
BamH I 1291 
Seal 1329 



Dsal 1443 
Nco\ 1443 
Sfy\ 1443 

Gsul 1643 



Nru I 2645 



Ecofl \ 1744 
Kpn2 I 1748 



BsaA I 2053 
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pDONR201 4470 bp (rotated to position 3516) 

Location (Base Nos . ) Gene Encoded 

260 . .29 attPl 

656 . . 961 ccdB 

1099 . . 1184 ccdA 

1303 . . 1962 CmR 

2210 . . 2442 attP2 

2565 . .3374 Kmr 

3495 . .4134 ori 

1 GTTAACGCTA GCATGGATCT CGGGCCCCAA ATAATGATTT TATTTTGACT G AT AGTGAC C 
61 TGTTCGTTGC AACAAATTGA TGAGCAATGC TTTTTTATAA TGCCAACTTT GTACAAAAAA 
121 GCTGAACGAG AAACGTAAAA TGATATAAAT AT C AAT AT AT T AAAT TAG AT TTTGCATAAA 
181 AAAC AG AC T A CATAATACTG TAAAACACAA CAT AT C C AG T C ACT AT G AAT CAACTACTTA 
241 GATGGTATTA GTG AC CTGT A GTCGACCGAC AGCCTTCCAA ATGTTCTTCG GGTGATGCTG 
3 01 CCAACTTAGT C G AC CG AC AG CCTTCCAAAT GTTCTTCTCA AACGGAATCG TCGTATCCAG 
361 CCTACTCGCT ATTGTCCTCA ATGCCGTATT AAAT CAT AAA AAGAAATAAG AAAAAGAGGT 
421 GCGAGCCTCT TTTTTGTGTG ACAAAATAAA AACATCTACC TATTCATATA CGCTAGTGTC 
481 ATAGTCCTGA AAAT CAT CT G CATCAAGAAC AATTTCACAA C T C TT AT AC T TTTCTCTTAC 
541 AAGTCGTTCG GCTTCATCTG GATTTTCAGC CTCTATACTT AC T AAAC GTG ATAAAGTTTC 
6 01 TGTAATTTCT ACTGTATCGA CCTGCAGACT GGCTGTGTAT AAGGGAGCCT G AC AT TT ATA 
661 TTCCCCAGAA CATCAGGTTA ATGGCGTTTT TGATGTCATT TTCGCGGTGG CTG AG AT C AG 
721 CCACTTCTTC CCCGATAACG GAGACCGGCA CACTGGCCAT ATCGGTGGTC ATCATGCGCC 
781 AGCTTTCATC CCCGATATGC AC C AC C GGGT AAAGTTCACG GGAGACTTTA TCTGACAGCA 
841 GACGTGCACT GGCCAGGGGG ATCACCATCC GTCGCCCGGG CGTGTCAATA ATATCACTCT 
901 GTACATCCAC AAAC AG AC G A TAACGGCTCT CTCTTTTATA GGTGTAAACC TTAAACTGCA 
961 TTTCACCAGT CCCTGTTCTC GTCAGCAAAA GAGCCGTTCA TTTCAATAAA CCGGGCGACC 
1021 TCAGCCATCC CTTCCTGATT TTCCGCTTTC CAGCGTTCGG CACGCAGACG ACGGGCTTCA 
1081 TTCTGCATGG TTGTGCTTAC C AG AC C G GAG ATATTGACAT CATATATGCC TTGAG CAACT 
1141 GATAGCTGTC GCTGTCAACT GTCACTGTAA TACGCTGCTT CATAGCACAC CTCTTTTTGA 
12 01 CATACTTCGG GTATACATAT CAGTATATAT T C T TAT AC CG C AAAAAT C AG CGCGCAAATA 

12 61 CGCATACTGT TATCTGGCTT TTAGTAAGCC GGATCCACGC GATTACGCCC CGCCCTGCCA 

13 21 CTCATCGCAG TACTGTTGTA ATT CAT TAAG CATTCTGCCG ACATGGAAGC C AT C ACAGAC 

13 81 GGCATGATGA AC CTG AAT CG CCAGCGGCAT CAGCACCTTG TCGCCTTGCG TATAATATTT 

14 41 GCCCATGGTG AAAACGGGGG CGAAGAAGTT GTCCATATTG GCCACGTTTA AATCAAAACT 

15 01 GGTGAAACTC AC C CAGGGAT TGGCTGAGAC GAAAAACATA TTCTCAATAA ACCCTTTAGG 
1561 GAAATAGGCC AGGTTTTCAC CGTAACACGC C AC ATCTTG C GAATATATGT GTAGAAACTG 
1621 CCGGAAATCG TCGTGGTATT CACTCCAGAG CGATGAAAAC GTTTCAGTTT GCTCATGGAA 

16 81 AACGGTGTAA CAAGGGTGAA CACTATCCCA TATCACCAGC TCACCGTCTT TCATTGCCAT 
1741 ACGGAATTCC GG ATG AG CAT TCATCAGGCG GGCAAGAATG TGAATAAAGG CCGGATAAAA 
18 01 CTTGTGCTTA TTTTTCTTTA CGGTCTTTAA AAAGGCCGTA ATATCCAGCT GAACGGTCTG 
18 61 G TTATAGGT A CATTGAGCAA CTGACTGAAA TGCCTCAAAA TGTTCTTTAC GATGCCATTG 
1921 GGATATATCA ACGGTGGTAT ATC CAGTGAT TTTTTTCTCC ATTTTAGCTT CCTTAGCTCC 
1981 TGAAAATCTC GATAACTCAA AAAATACGCC CGGTAGTGAT C TTATTT CAT TATGGTGAAA 
2 041 GTTGGAACCT CTTACGTGCC GATCAACGTC TCATTTTCGC CAAAAGTTGG CCCAGGGCTT 
2101 CCCGGTATCA ACAGGGACAC CAGGATTTAT TTATTCTGCG AAGTGATCTT CCGTCACAGG 
2161 TATTTATTCG GCGCAAAGTG CGTCGGGTGA TGCTGCCAAC TTAGTCGACT ACAGGTCACT 
2 221 AAT AC CAT C T AAGTAG TTG A T T CAT AG TG A CTGGATATGT TGTGTTTTAC AG TAT TATGT 
22 81 AGT CTGTTTT TTATGCAAAA TCTAATTTAA TATATTGATA TTTATATCAT TTTACGTTTC 
2 341 TCGTTCAGCT TTCTTGTACA AAGTTGGCAT TATAAGAAAG CATTGCTTAT CAATTTGTTG 
24 01 CAACGAACAG GTCACTATCA GTCAAAATAA AATCATTATT TGCCATCCAG CTGCAGCTCT 
24 61 GGC CCGTGTC TCAAAATCTC TG AT G T T AC A TTGCACAAGA TAAAAATATA T CAT CAT G AA 
2 521 CAATAAAACT GTCTGCTTAC ATAAACAGTA ATACAAGGGG TGTTATGAGC CATATTCAAC 
2 581 GGGAAACGTC GAGGCCGCGA TTAAATTCCA ACATGGATGC TGATTTATAT GGGTATAAAT 
2 641 GGGCTCGCGA TAATGTCGGG CAATCAGGTG CGACAATCTA TCGCTTGTAT GGGAAGCCCG 
2 7 01 ATGCGCCAGA GTTGTTTCTG AAACATGGCA AAGGTAGCGT TGCCAATGAT GTTACAGATG 
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2 761 AG ATGGT C AG ACTAAACTGG CTGACGGAAT TTATGCCTCT TCCGACCATC AAGCATTTTA 
2 821 TCCGTACTCC TGATGATGCA TGGTTACTCA CCACTGCGAT CCCCGGAAAA ACAGCATTCC 
2 881 AGGTATTAGA AGAATATCCT GATTCAGGTG AAAATATTGT TGATGCGCTG GCAGTGTTCC 

2 941 TGCGCCGGTT GCATTCGATT CCTGTTTGTA ATTGTCCTTT TAACAGCGAT CGCGTATTTC 

3 0 01 GTCTCGCTCA GGCGCAATCA CGAATGAATA ACGGTTTGGT TGATGCGAGT GATTTTGATG 
3 061 ACGAGCGTAA TGGCTGGCCT GTTGAACAAG TCTGGAAAGA AATG CAT AAA CTTTTGCCAT 
3121 TCTCACCGGA TTCAGTCGTC ACTCATGGTG ATTTCTCACT TGATAACCTT ATTTTTGACG 
3181 AGGGGAAATT AATAGGTTGT ATTGATGTTG GACGAGTCGG AATCGCAGAC CGATACCAGG 
3 241 ATCTTGCCAT CCTATGGAAC TGCCTCGGTG AGTTTTCTCC TT C ATT AC AG AAACGG CTTT 
3 3 01 TTCAAAAATA TGGTATTGAT AATCCTGATA TGAATAAATT GCAGTTTCAT TTGATGCTCG 
3 361 ATGAGTTTTT CTAATCAGAA TTGGTTAATT GGTTGTAACA CTGGCAGAGC ATTACGCTGA 
3 421 CTTGACGGGA CGGCGCAAGC TCATGACCAA AATCCCTTAA CGTGAGTTTT CGTTCCACTG 
34 81 AGCGTCAGAC CCCGTAGAAA AGATCAAAGG ATCTTCTTGA GATCCTTTTT TTCTGCGCGT 
3 541 AATCTGCTGC TTGCAAACAA AAAAACCACC GCTACCAGCG GTGGTTTGTT TGCCGGATCA 
3 6 01 AGAGCTACCA ACTCTTTTTC CGAAGGTAAC TGGCTTCAGC AGAGCGCAGA TACCAAATAC 
3 661 TGTCCTTCTA GTGTAGCCGT AGTTAGGCCA CCACTTCAAG AACTCTGTAG CACCGCCTAC 
3 721 ATACCTCGCT CTGCTAATCC TGTTACCAGT GGCTGCTGCC AGTGGCGATA AGTCGTGTCT 
3 781 TACCGGGTTG GACTCAAGAC GATAGTTACC GGATAAGGCG CAGCGGTCGG GCTGAACGGG 
3 841 GGGTTCGTGC ACACAGCCCA GCTTGGAGCG AACGACCTAC AC CGAACTGA GATACCTACA 
3 901 GCGTGAGCTA TGAGAAAGCG CCACGCTTCC CGAAGGGAGA AAGGCGGACA GGTATCCGGT 

3 961 AAGCGGCAGG GTCGGAACAG GAGAGCG CAC GAGGGAGCTT CCAGGGGGAA ACGCCTGGTA 

4 021 TCTTTATAGT CCTGTCGGGT TTCGCCACCT CTGACTTGAG CGTCGATTTT TGTGATGCTC 
4 081 GTCAGGGGGG CGGAGCCTAT GGAAAAACGC CAGCAACGCG GCCTTTTTAC GGTTCCTGGC 
4141 CTTTTGCTGG CCTTTTGCTC ACATGTTCTT TCCTGCGTTA TCCCCTGATT CTGTGGATAA 
42 01 CCGTATTACC GCTAGCCAGG AAGAGTTTGT AGAAACG CAA AAAGGCCATC CGTCAGGATG 

42 61 GCCTTCTGCT TAGTTTGATG CCTGGCAGTT TATGGCGGGC GTCCTGCCCG CCACCCTCCG 

43 21 GGCCGTTGCT TCACAACGTT CAAATCCGCT CCCGGCGGAT TTGTCCTACT CAGGAGAGCG 
43 81 TTCACCGACA AACAACAGAT AAAACGAAAG GCCCAGTCTT CCGACTGAGC CTTTCGTTTT 
4441 ATTTGATGCC TGGCAGTTCC CTACTCTCGC 
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pDONR202 4204 bp 



Location (Base Nos . ) 
369 . . 127 
486 . . 1059 



1228 
2381 
2629 
3408 



2107 
2140 
3288 
3492 



3630 . .3935 



Gene Encoded 

attPl 

ori 

KmR 

attP2 

CmR 

inactivated ccdA 
ccdB 



1 CGGCATTGAG GACAATAGCG AGTAGGCTGG ATACGACGAT TCCGTTTGAG AAGAACATTT 

61 GGAAGGCTGT CGGTCGACTA AGTTGGCAGC AT C AC C CG AA GAACAT TTGG AAGGCTGTCG 

121 GTCGACTACA GGTCACTAAT AC CAT C T AAG TAGTTGATTC ATAGTGACTG GATATGTTGT 

181 GTTTTACAGT ATTATGTAGT CTGTTTTTTA TGCAAAATCT AATTTAATAT ATTGATATTT 

241 ATATCATTTT ACGTTTCTCG TTCAGCTTTT TTGTACAAAG TTGGCATTAT AAAAAAGCAT 

3 01 TGCTCATCAA TTTGTTGCAA CGAACAGGTC ACTATCAGTC AAAATAAAAT CATTATTTGG 

3 61 GGC CCGAGAT CCATGCTAGC GGTAATACGG TTATCCACAG AATCAGGGGA TAACGCAGGA 
421 AAGAACATGT GAGCAAAAGG CCAGCAAAAG GCCAGGAACC GTAAAAAGGC CGCGTTGCTG 

4 81 GCGTTTTTCC ATAGGCTCCG CCCCCCTGAC GAGCATCACA AAAATCG AC G CTCAAGTCAG 
541 AGGTGG CGAA ACCCGACAGG ACTATAAAGA TACCAGGCGT TTCCCCCTGG AAGCTCCCTC 
6 01 GTGCGCTCTC CTGTTC CGAC CCTGCCGCTT AC CGGATAC C TGTCCGCCTT TCTCCCTTCG 
661 GGAAGCGTGG CGCTTTCTCA TAGCTCACGC TGTAGGTATC TCAGTTCGGT GTAGGTCGTT 
721 CGCTCCAAGC TGGGCTGTGT GCACGAACCC CCCGTTCAGC CCGACCGCTG CGCCTTATCC 
781 GGTAACTATC GTCTTGAGTC CAACCCGGTA AGACACGACT TATCGCCACT GGCAGCAGCC 
841 ACTGGTAACA GGATTAGCAG AGCGAGGTAT GTAGGCGGTG CTACAGAGTT CTTGAAGTGG 
901 TGGCCTAACT ACGGCTACAC TAGAAGGACA GTATTTGGTA TCTGCGCTCT GCTGAAGCCA 
961 GTTACCTTCG GAAAAAGAGT TGGTAGCTCT TGATCCGGCA AACAAACCAC CGCTGGTAGC 

1021 GGTGGTTTTT TTGTTTGCAA GCAGCAGATT ACGCGCAGAA AAAAAGGATC T C AAG AAG AT 

1081 CCTTTGATCT TTTCTACGGG GTCTGACGCT CAGTGGAACG AAAACTC AC G TTAAGGGATT 

1141 TTGGTCATGA GCTTGCGCCG TCCCGTCAAG TCAGCGTAAT GCTCTGCCAG TGTTACAACC 

12 01 AATTAACCAA TTCTGATTAG AAAAACTCAT CGAGCATCAA ATGAAACTGC AATTTATTCA 
1261 TATCAGGATT AT C AAT AC C A TATTTTTGAA AAAGCCGTTT CTGTAATGAA G G AG AAAAC T 
1321 CACCGAGGCA GTTCCATAGG ATGGCAAGAT CCTGGTATCG GTCTGCGATT CCGACTCGTC 

13 81 CAACATCAAT ACAACCTATT AATTTCCCCT CGTCAAAAAT AAGGTTATCA AG TG AG AAAT 
1441 CACCATGAGT G AC G AC TG AA TCCGGTGAGA ATGGCAAAAG TTTATGCATT TCTTTCCAGA 
15 01 CTTGTTCAAC AGGCCAGCCA TTACGCTCGT CATCAAAATC ACTCGCATCA ACCAAACCGT 
1561 TATTCATTCG TGATTGCGCC TGAGCGAGAC GAAATACGCG ATCGCTGTTA AAAGGACAAT 
1621 TACAAACAGG AATCGAATGC AACCGGCGCA G G AAC AC TG C CAGCGCATCA AC AAT AT TT T 
1681 CACCTGAATC AGGATATTCT TCTAATACCT GGAATGCTGT TTTTCCGGGG ATCG CAGTGG 
1741 TGAGTAACCA TGCATCATCA GGAGTACGGA TAAAATGCTT GATGGTCGGA AGAGGCATAA 
18 01 ATTCCGTCAG CCAGTTTAGT CTGACCATCT CATCTGTAAC ATCATTGGCA ACGCTACCTT 
1861 TGCCATGTTT CAGAAACAAC TCTGGCGCAT CGGGCTTCCC AT AC AAG C G A TAGATTGTCG 
1921 CACCTGATTG CCCGACATTA TCGCGAGCCC ATTTATACCC AT AT AAAT C A GCATCCATGT 
1981 TGGAATTTAA TCGCGGCCTC GACGTTTCCC GTTGAATATG G C T CAT AAC A CCCCTTGTAT 
2 041 TACTGTTTAT GTAAGCAGAC AGTTTTATTG TTCATGATGA TATATTTTTA TCTTGTGCAA 
2101 T GTAAC AT C A GAGATTTTGA GACACGGGCC AGAGCTGCAG CTGGATGGCA AATAATG AT T 
2161 TTATTTTGAC TGATAGTGAC CTGTTCGTTG CAACAAATTG AT AAG C AAT G CTTTCTTATA 
2221; ATGCCAACTT TGTACAAGAA AGCTGAACGA GAAACGTAAA ATGATATAAA TAT C AAT AT A 
22 81 TTAAATTAGA TTTTGCATAA AAAACAGACT AC AT AAT AC T GTAAAACACA AC AT AT C C AG 
2 341 TCACTATGAA TCAACTACTT AGATGGTATT AGTGACCTGT AGTCGACTAA GTTGGCAGCA 
24 01 TCACCCGACG CACTTTGCGC C G AAT AAAT A CCTGTGACGG AAGATCACTT C G C AG AAT AA 
2461 ATAAATCCTG GTGTCCCTGT TGATACCGGG AAGCCCTGGG CCAACTTTTG G CG AAAATGA 
2 521 GACGTTGATC GGCACGTAAG AGGTTCCAAC T T T C AC CAT A ATGAAATAAG ATCACTACCG 
2 581 GGCGTATTTT TTGAGTTATC GAGATTTTCA GGAGCTAAGG AAGCTAAAAT GGAGAAAAAA 
2 641 AT C ACTGG AT ATACCACCGT TGATATATCC CAATGGCATC GTAAAGAACA TTTTGAGGCA 

2 701 TTTCAGTCAG TTGCTCAATG TACCTATAAC C AG AC CGTT C AGCTGGATAT TACGGCCTTT — 
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2 761 TTAAAGACCG TAAAGAAAAA TAAGCACAAG TTTTATCCGG CCTTTATTCA CATTCTTGCC 

2 821 CGCCTGATGA ATGCTCATCC GGAATTCCGT ATGGCAATGA AAGACGGTGA GCTGGTGATA 

28 81 TGGGATAGTG TTCACCCTTG TTACACCGTT TTCCATGAGC AAACTGAAAC GTTTTCATCG 

2 941 CTCTGGAGTG AAT AC C AC G A CGATTTCCGG CAGTTTCTAC AC AT AT ATT C GCAAGATGTG 

3 001 GCGTGTTACG GTGAAAACCT GGCCTATTTC CCTAAAGGGT TTATTGAGAA TATGTTTTTC 
3 061 GTCTCAGCCA ATCCCTGGGT GAGTTTCACC AGTTTTGATT TAAACGTGGC CAATATGGAC 
3121 AACTTCTTCG CCCCCGTTTT CACCATGGGC AAATATTATA CGCAAGGCGA CAAGGTGCTG 
3181 ATGCCGCTGG CGATTCAGGT TCATCATGCC GTCTGTGATG GCTTCCATGT CGGCAGAATG 
3 241 CTTAATGAAT TACAACAGTA CTGCGATGAG TGGCAGGGCG GGGCGTAATC GCGTGGATCC 
3 3 01 GGCTTACTAA AAG C C AG AT A ACAGTATGCG TATTTGCGCG CTGATTTTTG C GGTATAAG A 

33 61 ATATATACTG ATATGTATAC CCGAAGTATG TCAAAAAGAG GTGTGCTATG AAGCAGCGTA 
3421 TTACAGTGAC AG TTG AC AG C GACAGCTATC AGTTGCTCAA GG C AT ATATG ATGTCAATAT 

34 81 CTCCGGTCTG GTAAGCACAA CCATGCAGAA TGAAGCCCGT CGTCTGCGTG CCGAACGCTG 
3 541 GAAAGCGGAA AATCAGGAAG GGATGGCTGA GGTCGCCCGG TTTATTGAAA TGAACGGCTC 
3 6 01 TTTTGCTGAC GAGAACAGGG ACTGGTGAAA TGCAGTTTAA GG TTTAC AC C TATAAAAGAG 
3661 AGAGCCGTTA TCGTCTGTTT GTGGATGTAC AGAGTGATAT TATTGACACG CCCGGGCGAC 
3 721 GGATGGTGAT CCCCCTGGCC AGTGCACGTC TGCTGTCAGA TAAAGTCTCC CGTGAACTTT 
3 781 ACCCGGTGGT GCATATCGGG GATGAAAGCT GGCGCATGAT GACCACCGAT ATGGCCAGTG 
3 841 TGCCGGTCTC CGTTATCGGG G AAG AAG T G G C TG AT C T C AG CCACCGCGAA AATG AC AT C A 
3 901 AAAACGCCAT TAACCTGATG TTCTGGGGAA TATAAATGTC AGGCTCCCTT AT AC AC AG C C 

3 961 AGTCTGCAGG TCGATACAGT AGAAATTACA GAAACTTTAT CACGTTTAGT AAGTATAGAG 

4 021 GCTGAAAATC CAGATGAAGC CGAACGACTT GTAAGAGAAA AGTATAAGAG TTGTGAAATT 
4 081 GTTCTTGATG CAGATGATTT TCAGGACTAT GACACTAGCG TATATGAATA GGTAGATGTT 
4141 TTTATTT TGT CACACAAAAA AGAGGCTCGC ACCTCTTTTT CTTATTTCTT TTTATGATTT 
4201 AATA 
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pDONR203 4208 bp 



Location (Base Nos . ) 
4 7 . . 131 
251 . . 910 
1158 . . 1398 
1509 - .2082 
2251. .3130 
3464 . .3174 
3812 . .4117 



Gene Encoded 

inactivated ccdA 

CmR 

attP2 

ori 

KmR 

attPl 

ccdB 



1 GCGTTCGGCA CGCAGACGAC GGGCTTCATT CTGCATGGTT GTGCTTACCA GACCGGAGAT 
61 ATTGACATCA TATATGC CTT GAG C AACTGA TAGCTGTCGC TGTCAACTGT CACTGTAATA 
121 CGCTGCTTCA TAGCACACCT CTTTTTGACA TACTTCGGGT ATACATATCA GTATATATTC 
181 TTATACCGCA AAAATCAGCG CGCAAATACG CAT AC TGTT A TCTGGCTTTT AGTAAGCCGG 

2 41 ATCCACGCGT TTACGCCCCG CCCTGCCACT CATCGCAGTA CTGTTGTAAT T CATTAAG C A 

3 01 TTCTGCCGAC ATGGAAGC C A TCACAGACGG CATGATGAAC CTGAATCGCC AGCGGCATCA 

3 61 GCACCTTGTC GCCTTGCGTA TAATATTTGC CCATGGTGAA AACGGGGGCG AAGAAGTTGT 

4 21 CCATATTGGC CACGTTTAAA TCAAAACTGG TGAAACTCAC CCAGGGATTG GCTGAGACGA 
481 AAAACATATT CTCAATAAAC CCTTTAGGGA AATAGG C C AG GTTTTCACCG TAACACGCCA 
541 CATCTTGCGA ATATATGTGT AGAAACTGCC GGAAATCGTC GTGGTATTCA CTCCAGAGCG 
601 ATGAAAACGT TTCAGTTTGC TCATGGAAAA CGGTGTAACA AGGGTGAACA CTATCCCATA 
661 TCACCAGCTC ACCGTCTTTC ATTGC CATAC GGAATTCCGG ATGAGCATTC ATCAGGCGGG 
721 CAAGAATGTG AATAAAGGCC GGATAAAACT TGTGCTTATT TTTCTTTACG GTCTTTAAAA 
781 AGGCCGTAAT ATCCAGCTGA ACGGTCTGGT TATAGGTACA TTGAGCAACT G AC TGAAATG 
841 CCTCAAAATG TTCTTTACGA TGCCATTGGG ATATATCAAC GGTGGTATAT CCAGTGATTT 
901 TTTTCTCCAT TTTAGCTTCC TTAGCTCCTG AAAATCTCGA TAACTCAAAA AATACGCCCG 
961 GTAGTGATCT TATTTCATTA TGGTGAAAGT TGGAACCTCT TACGTGCCGA TCAACGTCTC 

1021 ATTTTCGCCA AAAGTTGGCC CAGGGCTTCC CGGTATCAAC AGGGACACCA GGATTTATTT 
1081 ATTCTGCGAA GTGATCTTCC GTCACAGGTA TTTATTCGGC GCAAAGTGCG TCGGGTGATG 
1141 CTGCCAACTT AGTCGACTAC AGGTCACTAA TACCATCTAA G TAGTTG AT T CAT AG TG AC T 
12 01 GGATATGTTG TGTTTTACAG TATTATGTAG TCTGTTTTTT ATGCAAAATC TAATTTAATA 

12 61 TATTGATATT TAT AT C AT TT TACGTTTCTC GTTCAGCTTT C T TGT AC AAA GTTGGCATTA 

13 21 TAAGAAAGCA TTGCTTATCA ATTTGTTGCA ACGAACAGGT CACTATCAGT CAAAATAAAA 
13 81 T CATTATTTG CCATCCAGCT AGCGGTAATA CGGTTATCCA CAGAATCAGG GGATAACGCA 
1441 GGAAAGAACA TGTGAGCAAA AGGCCAGCAA AAGGCCAGGA ACCGTAAAAA GGCCGCGTTG 

15 01 CTGGCGTTTT TCCATAGGCT CCGCCCCCCT GACGAGCATC ACAAAAATCG ACGCTCAAGT 
1561 CAGAGGTGGC GAAACCCGAC AGGACTATAA AGATACCAGG CGTTTCCCCC TGGAAGCTCC 
1621 CTCGTGCGCT CTCCTGTTCC GACCCTGCCG CTTACCGGAT ACCTGTCCGC CTTTCTCCCT 

16 81 TCGGGAAGCG ' TGGCGCTTTC TCATAGCTCA CGCTGTAGGT ATCTCAGTTC GGTGTAGGTC 
1741 GTTCGCTCCA AGCTGGGCTG TGTGCACGAA CCCCCCGTTC AGCCCGACCG CTGCGCCTTA 
18 01 TCCGGTAACT ATCGTCTTGA GTCCAACCCG GTAAGACACG ACTTATCGCC ACTGGCAGCA 
1861 GCCACTGGTA ACAGGATTAG CAGAGCGAGG TATGTAGGCG GTGCTACAGA GTTCTTGAAG 
1921 TGGTGGCCTA ACTACGGCTA CACTAGAAGA ACAGTATTTG GTATCTGCGC TCTGCTGAAG 
1981 CCAGTTACCT TCGGAAAAAG AGTTGGTAGC TCTTGATCCG GCAAACAAAC CACCGCTGGT 
2 041 AGCGGTGGTT TTTTTGTTTG CAAGCAGCAG ATTACGCGCA GAAAAAAAGG ATCTCAAGAA 
2101 GATCCTTTGA TCTTTTCTAC GGGGTCTGAC GCTCAGTGGA ACGAAAACTC ACGTTAAGGG 
2161 ATTTTGGTCA TGAGCTTGCG CCGTCCCGTC AAGTCAGCGT AATGCTCTGC CAGTGTTACA 
2221 ACCAATTAAC CAATTCTGAT TAGAAAAACT CATCGAGCAT CAAATGAAAC TGCAATTTAT 
22 81 TCATATCAGG ATTATCAATA CCATATTTTT GAAAAAGCCG TTTCTGTAAT GAAGGAGAAA 
2 341 ACTCACCGAG GCAGTTCCAT AGGATGGCAA GATCCTGGTA TCGGTCTGCG ATTCCGACTC 
24 01 GTCCAACATC AATACAACCT ATTAATTTCC CCTCGTCAAA AATAAGGTTA T C AAGTG AG A 
24 61 AATCACCATG AGTGACGACT GAATCCGGTG AGAATGGCAA AAGTTTATGC ATTTCTTTCC 
2 521 AGACTTGTTC AACAGG CCAG CCATTACGCT CGTCATCAAA ATCACTCGCA TCAACCAAAC 
2 5 81 CGTTATTCAT TCGTGATTGC GCCTGAGCGA GACGAAATAC GCGATCGCTG TTAAAAGGAC 
2 641 AATTACAAAC AGGAATCGAA TGCAACCGGC GCAGGAACAC TGCCAGCGCA TCAACAATAT 
2 7 01 TTTCACCTGA AT C AGG AT AT TCTTCTAATA CCTGGAATGC TGTTTTTCCG GGGATCGCAG ■ 
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2 761 TGGTGAGTAA CCATGCATCA TCAGGAGTAC GGATAAAATG CTTGATGGTC GGAAGAGGCA 
2 821 TAAATTCCGT CAGCCAGTTT AGTCTGACCA TCTCATCTGT AACATCATTG GCAACGCTAC 
2 881 CTTTGCCATG TTTCAGAAAC AACTCTGGCG CATCGGGCTT CCCATACAAG CGATAGATTG 

2 941 TCGCACCTGA TTGCCCGACA TTATCGCGAG C C CAT T TATA C C CAT AT AAA TCAGCATCCA 

3 0 01 TGTTGGAATT TAATCGCGGC CTCGACGTTT CCCGTTGAAT ATGGCTCATA ACACCCCTTG 
3 061 TATTACTGTT TATGTAAGCA GACAGTTTTA TTGTTCATGA TGATATATTT TTATCTTGTG 
3121 CAATGTAACA TCAGAGATTT TGAGACACGG GCCAGAGCTG CAGCTAGCAT GGATCTCGGG 
3181 CCCCAAATAA TGATTTTATT TTGACTGATA GTGACCTGTT CGTTGCAACA AATTGATGAG 
3 241 CAATGCTTTT TTATAATGCC AACTTTGTAC AAAAAAGCTG AACGAGAAAC GTAAAATGAT 
3 3 01 ATAAATATCA ATATATTAAA TTAGATTTTG CATAAAAAAC AGACTACATA ATACTGTAAA 
3 361 ACACAACATA TCCAGTCACT ATGAATCAAC TACTTAGATG GTATTAGTGA CCTGTAGTCG 
3421 ACCGACAGCC TTCCAAATGT TCTTCGGGTG ATGCTGCCAA CTTAGTCGAC CGACAGCCTT 
3 481 CCAAATGTTC TTCTCAAACG GAATCGTCGT ATCCAGCCTA CTCGCTATTG TCCTCAATGC 
3 541 CGTATTAAAT CATAAAAAGA AATAAGAAAA AGAGGTGCGA GCCTCTTTTT TGTGTGACAA 
36 01 AATAAAAACA TCTACCTATT CATATACGCT AGTGTCATAG TCCTGAAAAT CATCTGCATC 
3 661 AAGAACAATT TCACAACTCT TATACTTTTC TCTTACAAGT CGTTCGGCTT CAT C T GG ATT 
3 721 TTCAGCCTCT ATACTTACTA AACGTGATAA AGTTTCTGTA ATTTCTACTG TATCGACCTG 
3 781 CAGACTGGCT GTGTATAAGG GAGC CTGAC A TTTATATTCC C C AG AAC AT C AGGTTAATGG 
3 841 CGTTTTTGAT GTCATTTTCG CGGTGGCTGA GATCAGCCAC TTCTTCCCCG ATAACGGAGA 
3 901 CCGGCACACT GGCCATATCG GTGGTCATCA TGCGCCAGCT TTCATCCCCG ATATG CACCA 

3 961 CCGGGTAAAG TTCACGGGAG ACTTTATCTG ACAGCAGACG TGCACTGGCC AGGGGGATCA 

4 021 CCATCCGTCG CCCGGGCGTG TCAATAATAT CACTCTGTAC ATCCACAAAC AGACGATAAC 
4 081 GGCTCTCTCT TTTATAGGTG TAAACCTTAA ACTGCATTTC ACCAGTCCCT GTTCTCGTCA 
4141 GCAAAAGAGC CGTTCATTTC AATAAACCGG GCGACCTCAG CCATCCCTTC CTGATTTTCC 
42 01 GCTTTCCA 
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1 CGGCATTGAG GACAATAGCG AGTAGGCTGG 
61 GGAAGGCTGT CGGTCGACTA CAGGTCACTA 
121 TGGATATGTT GTGTTTTACA GTATTATGTA 
181 ATATTGATAT TTATATCATT TTACGTTTCT 
241 ATAAAAAAGC ATTGCTTATC AATTTGTTGC 
301 AT CAT T ATTT GGGGCCCGAG ATCCATGCTA 
361 TCTCTGATGT TACATTGCAC AAGATAAAAA 
421 TTACATAAAC AGTAATACAA GGGGTGTTAT 
4 81 GAGGCCGCGA TTAAATT C C A ACATGGATGC 
541 TAATGTCGGG CAATCAGGTG CGACAATCTT 
6 01 GTTGTTTCTG AAACATGGCA AAGGTAGCGT 
661 ACTAAACTGG CTGACGGAAT TTATGCCTCT 
721 TGATGATGCA TGGTTACTCA CCACTGCGAT 
781 AGAATATCCT GATTCAGGTG AAAATATTGT 
841 GCATTCGATT CCTGTTTGTA ATTGTCCTTT 
901 GGCGCAATCA C G AATGAAT A ACGGTTTGGT 
961 TGGCTGGCCT GTTGAACAAG TCTGGAAAGA 
1021 TTCAGTCGTC ACTCATGGTG ATTTCTCACT 
1081 AATAGGTTGT ATTGATGTTG GACGAGTCGG 
1141 C CTATGG AAC TGCCTCGGTG AGTTTTCTCC 

12 01 TGGTATTGAT AATCCTGATA TGAATAAATT 
1261 CTAATCAGAA TTGGTTAATT GGTTGTAACA 
1321 CGGCGNCATG ACCAAAATCC CTTAACGTGA 

13 81 AG AAAAG AT C AAAGGATCTT CTTGAGATCC 
1441 AACAAAAAAA CCACCGCTAC CAGCGGTGGT 

15 01 TTTTCCGAAG GTAACTGGCT TCAGCAGAGC 
1561 GCCGTAGTTA GGCCACCACT TCAAGAACTC 
1621 AATCCTGTTA CCAGTGGCTG CTGCCAGTGG 

16 81 AAG AC GAT AG TTACCGGATA AGGCGCAGCG 
1741 GCCCAGCTTG GAG C G AAC G A CCTACACCGA 
18 01 AAGCGCCACG CTTCCCGAAG GGAGAAAGGC 
1861 AACAGGAGAG CGCACGAGGG AGCTTCCAGG 
1921 CGGGTTTCGC CACCTCTGAC TTGAGCGTCG 
1981 CCTATGGAAA AACGCCAGCA ACGCGGCCTT 
2 041 TGCTCACATG TTCTTTCCTG CGTTATCCCC 
2101 CTGGATCGGC AAATAATGAT TTTATTTTGA 
2161 GATAAGCAAT GCTTTTTTAT AATGCCAACT 
2221 AATGATATAA ATATCAATAT ATTAAATTAG 
22 81 TGTAAAACAC AACATATCCA GTCACTATGA 
2 341 TAGTCGACTA AGTTGGCAGC ATCACCCGAC 
24 01 G AAG AT C AC T TCGCAGAATA AATAAATCCT 
24 61 GCCAACTTTT GGCGAAAATG AGACGTTGAT 
2 521 CTTACAAGTC GTTCGGCTTC ATCTGGATTT 
2 581 GTTTCTGTAA TTTCTACTGT ATCGAC CTGC 
2 641 : * TTATATT C C C CAGAACATCA GGTTAATGGC 
2 7 01 ATCAGCCACT TCTTCCCCGA TAACGGAGAC 
2 761 GCGCCAGCTT TCATCCCCGA TATGCACCAC 
2 8 21 CAGCAGACGT GCACTGGCCA GGGGGATCAC 
2 8 81 ACTCTGTACA TCCACAAACA GACGATAACG 

2 941 CTGCATTTCA CCAGTCCCTG TTCTCGTCAG 

3 0 01 CGACCTCAGC CATCCCTTCC TGATTTTCCG 
3 061 CTTCATTCTG CATGGTTGTG CTTACCAGAC 
3121 CAACTGATAG CTGTCGCTGT CAACTGTCAC 



ATACGACGAT TCCGTTTGAG AAGAACATTT 
ATACCATCTA AGTAGTTGAA T C ATAGTG AC 
GTCTGTTTTT TATGCAAAAT CTAATTTAAT 
CGTTCAGCTT TTTTGTACAA AGTTGGCATT 
AACGAACAGG TCACTATCAG TCAAAATAAA 
GCTGCAGTGC GCAGGGCCCG TGTCTCAAAA 
TATATC AT C A TGAACAATAA AACTGTCTGC 
GAGCCATATT CAACGGGAAA CGTCTTGCTG 
TG AT TTATAT GGGTATAAAT GGGCTCGCGA 
TCGATTGTAT GGGAAGCCCG ATGCGCCAGA 
TGC CAATGAT GTTACAGATG AGATGGTCAG _ 
TCCGACCATC AAGCATTTTA TCCGTACTCC 
CCGCGGGAAA AC AG CAT T C C AGGTATTAGA 
TGATGCGCTG GCAGTGTTCC TGCGCCGGTT 
TAACAGCGAT CGCGTATTTC GTCTCGCTCA 
TGATGCGAGT GATTTTGATG ACGAGCGTAA 
AATGCATACG CTTTTGCCAT TCTCACCGGA 
TGATAACCTT ATTTTTGACG AGGGGAAATT 
AATCGCAGAC CGATACCAGG ATCTTGCCAT 
TTCATTACAG AAACGGCTTT TTCAAAAATA 
GCAGTTTCAT TTGATGCTCG ATGAGTTTTT 
CTGGCAGAGC ATTACGCTGA CTTGACGGGA 
GTTTTCGTTC CACTGAGCGT CAGACCCCGT 
TTTTTTTCTG CGCGTAATCT GCTGCTTGCA 
TTGTTTGCCG GATCAAGAGC TACCAACTCT 
GCAGATACCA AATACTGTCC TTCTAGTGTA 
TGTAGCACCG CCTACATACC TCGCTCTGCT 
CGATAAGTCG TGTCTTACCG GGTTGGACTC 
GTCGGGCTGA ACGGGGGGTT CGTGCACACA 
AC T GAG ATA C CTACAGCGTG AGCTATGAGA 
GGACAGGTAT CCGGTAAGCG GCAGGGTCGG 
GGGAAACGCC TGGTATCTTT ATAGTCCTGT 
ATTTTTGTGA TGCTCGTCAG GGGGGCGGAG 
TTTACGGTTC CTGGCCTTTT GCTGGCCTTT 
TGATTCTGTG GATAACCGTA TTACCGCTAG 
CTGATAGTGA CCTGTTCGTT GCAACAAATT 
TTGTACAAGA AAGCTGAACG AGAAAC GTAA 
ATTTTGCATA AAAAACAGAC TACATAATAC 
TT C AAC T AC T TAGATGGTAT TAGTGACCTG 
GCACTTTGCG CCGAATAAAT ACCTGTGACG 
GGTGTCCCTG TTGATACCGG GAAGCCCTGG 
CGGCACATTT CACAACTCTT ATACTTTTCT 
TCAGCCTCTA TACTTACTAA ACGTGATAAA 
AGACTGGCTG TGTATAACGG AG C CT G AC AT 
GTTTTTGATG TCATTTTCGC GGTGGCTGAG 
CGGCACACTG GCCATATCGG TGGTCATCAT 
CGGGTAAAGT TCACGGGAGA CTTTATCTGA 
CATCCGTCGC CCGGGCGTGT C AATAAT AT C 
GCTCTCTCTT TTATAGGTGT AAAC C TT AAA 
C AAAAG AG C C GTTCATTTCA ATAAACCGGG 
CTTTCCAGCG TTCGGCACGC AGACGACGGG 
C GG AG AT ATT G AC AT CAT AT ATGCCTTGAG 
TGTAATACGC TGCTTCATAG CACACCTCTT- 
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3181 TTTGACATAC TTCGGGTATA CATATCAGTA TATATTCTTA TACCGCAAAA ATCAGCGCGC 
3 241 AAAT AC G CAT ACTGTTATCT GGCTTTTAGT AAGCCGGATC CACGCGTTTA CGCCCCGCCC 
3 3 01 TG C C AC T CAT CGCAGTACTG TTGTAATTCA TTAAGCATTC TGCCGACATG G AAG C CAT C A 
3 3 61 C AGACGG CAT GATGAACCTG AATCGCCAGC GGCATCAGCA CCTTGTCGCC TTGCGTATAA 
34 21 TATTTGCCCA TGGTGAAAAC GGGGGCGAAG AAGTTGTCCA TATTGGCCAC G TT T AAAT CA 
34 81 AAACTGGTGA AACTCACCCA GGGATTGGCT GAGACGAAAA ACATATTCTC AATAAACCCT 
3 541 TTAGGGAAAT AGGCCAGGTT TT C AC C G T AA CACGCCACAT CTTGCGAATA TATGTGTAGA 
3 6 01 AACTGCCGGA AATCGTCGTG GTATTCACTC CAGAGCGATG AAAACGTTTC AGTTTGCTCA 
3 661 TGGAAAACGG TGTAAC AAG G GTGAACACTA T C C CAT AT C A CCAGCTCACC GTCTTTCATT 
3 721 GCCATACGGA ATTC CGGATG AGCATTCATC AGGCGGGCAA GAATGTGAAT AAAGGCCGGA 
3 781 TAAAACTTGT GCTTATTTTT CTTTACGGTC TT T AAAAAGG CCGTAATATC CAGCTGAACG 
3 841 GTCTGGTTAT AGGTACATTG AGCAACTGAC TGAAATGCCT CAAAATGTTC TTTACGATGC 
3 9 01 CATTGGGATA TATCAACGGT GGTATATCCA GTGATTTTTT TCTCCATTTT AGCTTCCTTA 

3 961 GCTCCTGAAA AT CT CGATAA CTCAAAAAAT ACGCCCGGTA GTGATCTTAT TTCATTATGG 

4 021 TGAAAGTTGG AACCTCTTAC TGTTCTTGAT GCAGATGATT TTCAGGACTA TGACACTAGC 
4 081 ATATATGAAT AGGTAGATGT TTTTATTTTG T C AC AC AAAA AAGAGGCTCG CACCTCTTTT 
4141 TCTTATTTCT TTTTATGATT TAATA 
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pDONR205 4939 bp 



GGCATCAGCACCTTGTCGCCTTGCGTATAATATTTGCCCATGGTGAAAACGGGGGCGAAG 
AAGTTGTCCATATTGGCCACGTTTAAATCAAAACTGGTGAAACTCACCCAGGGATTGGCT 
GAGACGAAAAACATATTCTCAATAAACCCTTTAGGGAAATAGGCCAGGTTTTCACCGTAA 
CACGCCACATCTTGCGAATATATGTGTAGAAACTGCCGGAAATCGTCGTGGTATTCACTC 
CAGAGCGATGAAAACGTTTCAGTTTGCTCATGGAAAACGGTGTAACAAGGGTGAACACTA 
TCCCATATCACCAGCTCACCGTCTTTCATTGCCATACGGAATTCCGGATGAGCATTCATC 
AGGCGGGCAAGAATGTGAATAAAGGCCGGATAAAACTTGTGCTTATTTTTCTTTACGGTC 
TTTAAAAAGGCCGTAATATCCAGCTGAACGGTCTGGTTATAGGTACATTGAGCAACTGAC 
TGAAATGCCTCAAAATGTTCTTTACGATGCCATTGGGATATATCAACGGTGGTATATCCA 
GTGATTTTTTTCTCCATTTTAGCTTCCTTAGCTCCTGAAAATCTCGATAACTCAAAAAAT 
ACGCCCGGTAGTGATCTTATTTCATTATGGTGAAAGTTGGAACCTCTTACGTGCCGATCA 
ACGTCTCATTTTCGCCAAAAGTTGGCCCAGGGCTTCCCGGTATCAACAGGGACACCAGGA 
TTTATTTATTCTGCGAAGTGATCTTCCGTCACAGGTATTTATTCGGCGCAAAGTGCGTCG 
GGTGATGCTGCCAACTTAGTCGACTACAGGTCACTAATACCATCTAAGTAGTTGATTCAT 
AGTGACTGGATATGTTGTGTTTTACAGTATTATGTAGTCTGTTTTTTATGCAAAATCTAA 
TTTAATATATTGATATTTATATCATTTTACGTTTCTCGTTCAGCTTTCTTGTACAAAGTT 
GGCATTATAAGAAAGCATTGCTTATCAATTTGTTGCAACGAACAGGTCACTATCAGTCAA 
AATAAAATCATTATTTGCCATCCAGCTGCAGCTCTGGCCCGTGTCTCAAAATCTCTGATG 
TTACATTGCACAAGATAAAAATATATCATCATGAATTCTCATGTTTGACAGCTTATCATC 
GATAAGCTTT7VATGCGGTAGTTTATCACAGTTAAATTGCTAACGCAGTCAGGCACCGTGT 
ATGAAATCTAACAATGCGCTCATCGTCATCCTCGGCACCGTCACCCTGGATGCTGTAGGC 
ATAGGCTTGGTTATGCCGGTACTGCCGGGCCTCTTGCGGGATATCGTCCATTCCGACAGC 
ATCGCCAGTCACTATGGCGTGCTGCTAGCGCTATATGCGTTGATGCAATTTCTATGCGCA 
CCCGTTCTCGGAGCACTGTCCGACCGCTTTGGCCGCCGCCCAGTCCTGCTCGCTTCGCTA 
CTTGGAGCCACTATCGACTACGCGATCATGGCGACCACACCCGTCCTGTGGATCCTCTAC 
GCCGGACGCATCGTGGCCGGCATCACCGGCGCCACAGGTGCGGTTGCTGGCGCCTATATC 
GCCGACATCACCGATGGGGAAGATCGGGCTCGCCACTTCGGGCTCATGAGCGCTTGTTTC 
GGCGTGGGTATGGTGGCAGGCCCCGTGGCCGGGGGACTGTTGGGCGCCATCTCCTTGCAT 
GCACCATTCCTTGCGGCGGCGGTGCTCAACGGCCTCAACCTACTACTGGGCTGCTTCCTA 
ATGCAGGAGTCGCATAAGGGAGAGCGTCGACCGATGCCCTTGAGAGCCTTCAACCCAGTC 
AGCTCCTTCCGGTGGGCGCGGGGCATGACTATCGTCGCCGCACTTATGACTGTCTTCTTT 
ATCATGCAACTCGTAGGACAGGTGCCGGCAGCGCTCTGGGTCATTTTCGGCGAGGACCGC 
TTTCGCTGGAGCGCGACGATGATCGGCCTGTCGCTTGCGGTATTCGGAATCTTGCACGCC 
CTCGCTCAAGCCTTCGTCACTGGTCCCGCCACCAAACGTTTCGGCGAGAAGCAGGCCATT 
ATCGCCGGCATGGCGGCCGACGCGCTGGGCTACGTCTTGCTGGCGTTCGCGACGCGAGGC 
TGGATGGCCTTCCCCATTATGATTCTTCTCGCTTCCGGCGGCATCGGGATGCCCGCGTTG 
CAGGCCATGCTGTCCAGGCAGGTAGATGACGACCATCAGGGACAGCTTCAAGGATCGCTC 
GCGGCTCTTACCAGCCTAACTTCGATCATTGGACCGCTGATCGTCACGGCGATTTATGCC 
GCCTCGGCGAGCACATGGAACGGGTTGGCATGGATTGTAGGCGCCGCCCTATACCTTGTC 
TGCCTCCCCGCGTTGCGTCGCGGTGCATGGAGCCGGGCCACCTCGACCTGAATGGAAGCC 
GGCGGCACCTCGCTAACGGATTCACCACTCCAAGAATTGGAGCCAATCAATTCTTGCGGA 
GAACTGTGAATGCGCT^AACCAACCCTTGGCAGAACATATCCATCGCATGACCAAAATCCC 
TTAACGTGAGTTTTCGTTCCACTGAGCGTCAGACCCCGTAGAAAAGATCAAAGGATCTTC 
TTGAGATCCTTTTTTTCTGCGCGTAATCTGCTGCTTGCAAACAAAAAAACCACCGCTACC 
AGCGGTGGTTTGTTTGCCGGATCAAGAGCTACCAACTCTTTTTCCGAAGGTAACTGGCTT 
CAGCAGAGCGCAGATACCAAATACTGTCCTTCTAGTGTAGCCGTAGTTAGGCCACCACTT 
CAAGAACTCTGTAGCACCGCCTACATACCTCGCTCTGCTAATCCTGTTACCAGTGGCTGC 
TGCCAGTGGCGATAAGTCGTGTCTTACCGGGTTGGACTCAAGACGATAGTTACCGGATAA 
GGCGCAGCGGTCGGGCTGAACGGGGGGTTCGTGCACACAGCCCAGCTTGGAGCGAACGAC 
CTACACCGAACTGAGATACCTACAGCGTGAGCTATGAGAAAGCGCCACGCTTCCCGAAGG 
GAGAAAGGCGGACAGGTATCCGGTAAGCGGCAGGGTCGGAACAGGAGAGCGCACGAGGGA 
GCTTCCAGGGGGAAACGCCTGGTATCTTTATAGTCCTGTCGGGTTTCGCCACCTCTGACT 
TGAGCGTCGATTTTTGTGATGCTCGTCAGGGGGGCGGAGCCTATGGAAAAACGCCAGC7UV- 
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CGCGGCCTTTTTACGGTTCCTGGCCTTTTGCTGGCCTTTTGCTCACATGTTCTTTCCTGC 
GTTATCCCCTGATTCTGTGGATAACCGTATTACCGCTAGCCAGGAAGAGTTTGTAGAAAC 
GCAAAAAGGCCATCCGTCAGGATGGCCTTCTGCTTAGTTTGATGCCTGGCAGTTTATGGC 
GGGCGTCCTGCCCGCCACCCTCCGGGCCGTTGCTTCACAACGTTCAAATCCGCTCCCGGC 
GGATTTGTCCTACTCAGGAGAGCGTTCACCGACAAACAACAGATAAAACGAAAGGCCCAG 
TCTTCCGACTGAGCCTTTCGTTTTATTTGATGCCTGGCAGTTCCCTACTCTCGCGTTAAC 
G C TAG C ATGG AT CTCGGGCCC C AAAT AAT G AT TTT ATTTTG ACTG AT AG TG AC C TGTT C G 
TTGCAACAAATTGATGAGCAATGCTTTTTTATAATGCCAACTTTGTACAAAAAAGCTGAA 
CGAGAAACGTAAAATGATATAAATATCAATATATTAAATTAGATTTTGCATAAAAAACAG 
ACTAC ATAAT ACTGT AAAAC ACAAC ATAT C C AGT C AC T ATGAATC AACT AC TTAGATGGT 
ATTAGTGACCTGTAGTCGACCGACAGCCTTCCAAATGTTCTTCGGGTGATGCTGCCAACT 
TAGTCGACCGACAGCCTTCCAAATGTTCTTCTCAAACGGAATCGTCGTATCCAGCCTACT 
CGCTATTGTCCTCAATGCCGTATTAAATCATAAAAAGAAATAAGAAAAAGAGGTGCGAGC 
CTCTTTTTTGTGTGACAAAATAAAAACATCTACCTATTCATATACGCTAGTGTCATAGTC 
CTGAAAATCATCTGCATCAAGAACAATTTCACAACTCTTATACTTTTCTCTTACAAGTCG 
TTCGGCTTCATCTGGATTTTCAGCCTCTATACTTACTAAACGTGATAAAGTTTCTGTAAT 
TTCTACTGTATCGACCTGCAGACTGGCTGTGTATAAGGGAGCCTGACATTTATATTCCCC 
AGAACATCAGGTTAATGGCGTTTTTGATGTCATTTTCGCGGTGGCTGAGATCAGCCACTT 
CTTCCCCGATAACGGAGACCGGCACACTGGCCATATCGGTGGTCATCATGCGCCAGCTTT 
CATCCCCGATATGCACCACCGGGTAAAGTTCACGGGAGACTTTATCTGACAGCAGACGTG 
CACTGGCCAGGGGGATCACCATCCGTCGCCCGGGCGTGTCAATAATATCACTCTGTACAT 
CCACAAACAGACGATAACGGCTCTCTCTTTTATAGGTGTAAACCTTAAACTGCATTTCAC 
CAGTCCCTGTTCTCGTCAGCAAAAGAGCCGTTCATTTCAATAAACCGGGCGACCTCAGCC 
ATCCCTTCCTGATTTTCCGCTTTCCAGCGTTCGGCACGCAGACGACGGGCTTCATTCTGC 
ATGGTTGTGCTTACCAGACCGGAGATATTGACATCATATATGCCTTGAGCAACTGATAGC 
TGTCGCTGTCAACTGTCACTGTAATACGCTGCTTCATAGCACACCTCTTTTTGACATACT 
T C GGGTAT AC ATAT C AG T AT AT ATT CTTATAC CG C AAAAATC AGC G C G C AAAT AC G C ATA 
CTGTTATCTGGCTTTTAGTAAGCCGGATCCACGCGATTACGCCCCGCCCTGCCACTCATC 
GCAGTACTGTTGTAATTCATTAAGCATTCTGCCGACATGGAAGCCATCACAGACGGCATG 
ATG AAC CTGAAT C G C C AG C 
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pDONR206 4415 bp 



CGGCATTGAGGACAATAGCGAGTAGGCTGGATACGACGATTCCGTTTGAGAAGAACATTT 

GGAAGGCTGTCGGTCGACTACAGGTCACTAATACCATCTAAGTAGTTGAATCATAGTGAC 

TGGATATGTTGTGTTTTACAGTATTATGTAGTCTGTTTTTTATGCAAAATCTAATTTAAT 

ATATTGATATTTATATCATTTTACGTTTCTCGTTCAGCTTTTTTGTACAAAGTTGGCATT 

ATAAAAAAGCATTGCTTATCAATTTGTTGCAACGAACAGGTCACTATCAGTCAAAATAAA 

ATCATTATTTGGGGCCCGAGATCCATGCTAGCGGTAATACGGTTATCCACAGAATCAGGG 

GATAACGCAGGAAAGAACATGTGAGCAAAAGGCCAGCAAAAGGCCAGGAACCGTAAAAAG 

GCCGCGTTGCTGGCGTTTTTCCATAGGCTCCGCCCCCCTGACGAGCATCACAAAAATCGA 

CGCTCAAGTCAGAGGTGGCGAAACCCGACAGGACTATAAAGATACCAGGCGTTTCCCCCT 

GGAAGCTCCCTCGTGCGCTCTCCTGTTCCGACCCTGCCGCTTACCGGATACCTGTCCGCC 

TTTCTCCCTTCGGGAAGCGTGGCGCTTTCTCATAGCTCACGCTGTAGGTATCTCAGTTCG 

GTGTAGGTCGTTCGCTCCAAGCTGGGCTGTGTGCACGAACCCCCCGTTCAGCCCGACCGC 

TGCGCCTTATCCGGTAACTATCGTCTTGAGTCCAACCCGGTAAGACACGACTTATCGCCA 

CTGGCAGCAGCCACTGGTAACAGGATTAGCAGAGCGAGGTATGTAGGCGGTGCTACAGAG 

TTCTTGAAGTGGTGGCCTAACTACGGCTACACTAGAAGGACAGTATTTGGTATCTGCGCT 

CTGCTGAAGCCAGTTACCTTCGGAAAAAGAGTTGGTAGCTCTTGATCCGGCAAACAAACC 

ACCGCTGGTAGCGGTGGTTTTTTTGTTTGCAAGCAGCAGATTACGCGCAGAAAAAAAGGA 

TCTCAAGAAGATCCTTTGATCTTTTCTACGGGGTCTGACGCTCAGTGGAACGAAAACTCA 

CGTTAAGGGATTTTGGTCATGNCGCCGTCCCGTCAAGTCAGCGTAATGCTCTGCCAGTGT 

TACAACCAATTAACCAATTCTGATTAGAAAAACTCATCGAGCATCAAATGAAACTGCAAT 

TTATTCATATCAGGATTATCAATACCATATTTTTGAAAAAGCCGTTTCTGTAATGAAGGA 

GAAAACTCACCGAGGCAGTTCCATAGGATGGCAAGATCCTGGTATCGGTCTGCGATTCCG 

ACTCGTCCAACATCAATACAACCTATTAGCCGAGGTCTTCCGATCTCCTGAAGCCAGGGC 

AGATCCGTGCACAGCACCTTGCCGTAGAAGAACAGCAAGGCCGCCAATGCCTGACGATGC 

GTGGAGACCGAAACCTTGCGCTCGTTCGCCAGCCAGGACAGAAATGCCTCGACTTCGCTG 

CTGCCCAAGGTTGCCGGGTGACGCACACCGTGGAAACGGATGAAGGCACGAACCCAGTTG 

ACATAAGCCTGTTCGGTTCGTAAACTGTAATGCAAGTAGCGTATGCGCTCACGCAACTGG 

TCCAGAACCTTGACCGAACGCAGCGGTGGTAACGGCGCAGTGGCGGTTTTCATGGCTTGT 

TATGACTGTTTTTTTGTACAGTCTATGCCTCGGGCATCCAAGCAGCAAGCGCGTTACGCC 

GTGGGTCGATGTTTGATGTTATGGAGCAGCAACGATGTTACGCAGCAGCAACGATGTTAC 

GCAGCAGGGCAGTCGCCCTAAAACAAAGTTAGGTGGCTCAAGTATGGGCATCATTCGCAC 

ATGTAGGCTCGGCCCTGACCAAGTCAAATCCATGCGGGCTGCTCTTGATCTTTTCGGTCG 

TGAGTTCGGAGACGTAGCCACCTACTCCCAACATCAGCCGGACTCCGATTACCTCGGGAA 

CTTGCTCCGTAGTAAGACATTCATCGCGCTTGCTGCCTTCGACCAAGAAGCGGTTGTTGG 

CGCTCTCGCGGCTTACGTTCTGCCCAGGTTTGAGCAGCCGCGTAGTGAGATCTATATCTA 

TGATCTCGCAGTCTCCGGCGAGCACCGGAGGCAGGGCATTGCCACCGCGCTCATCAATCT 

CCTCAAGCATGAGGCCAACGCGCTTGGTGCTTATGTGATCTACGTGCAAGCAGATTACGG 

TGACGATCCCGCAGTGGCTCTCTATACAAAGTTGGGCATACGGGAAGAAGTGATGCACTT 

TGATATCGACCCAAGTACCGCCACCTAACAATTCGTTCAAGCCGAGATCGGCTTCCCGGC 

CTAATTTCCCCTCGTCAAAAATAAGGTTATCAAGTGAGAAATCACCATGAGTGACGACTG 

AATCCGGTGAGAATGGCAAAAGCGTATGCATTTCTTTCCAGACTTGTTCAACAGGCCAGC 

CATTACGCTCGTCATCAAAATCACTCGCATCAACCAAACCGTTATTCATTCGTGATTGCG 

CCTGAGCGAGACGAAATACGCGATCGCTGTTAAAAGGACAATTACAAACAGGAATCGAAT 

GCAACCGGCGCAGGAACACTGCCAGCGCATCAACAATATTTTCACCTGAATCAGGATATT 

CTTCTAATACCTGGAATGCTGTTTTCCCGCGGATCGCAGTGGTGAGTAACCATGCATCAT 

CAGGAGTACGGATAAAATGCTTGATGGTCGGAAGAGGCATAAATTCCGTCAGCCAGTTTA 

GTCTGACCATCTCATCTGTAACATCATTGGCAACGCTACCTTTGCCATGTTTCAGAAACA 

ACTCTGGCGCATCGGGCTTCCCATACAATCGAAAGATTGTCGCACCTGATTGCCCGACAT 

TATCGCGAGCCCATTTATACCCATATAAATCAGCATCCATGTTGGAATTTAATCGCGGCC 

TCCAGCAAGACGTTTCCCGTTGAATATGGCTCATAACACCCCTTGTATTACTGTTTATGT 

AAGCAGACAGTTTTATTGTTCATGATGATATATTTTTATCTTGTGCAATGTAACATCAGA 

GATTTTGAGACACGGGCCCNGCGCACTGCAGCTGGATCGGCAAATAATGATTTTATTTTG 

ACTGATAGTGACCTGTTCGTTGCAACAAATTGATAAGCAATGCTTTTTTATAATGCCAAC - 



TTTGTACAAGAAAGCTGAACGAGAAACGTAAAATGATATAAATATCAATATATTAAATTA 
GATTT TG C ATAAAAAAC AGAC TAC AT AATAC TGTAAAAC AC AAC AT AT C C AGT C AC TATG 
ATTCAACTACTTAGATGGTATTAGTGACCTGTAGTCGACTAAGTTGGCAGCATCACCCGA 
CGCACTTTGCGCCGAATAAATACCTGTGACGGAAGATCACTTCGCAGAATAAATAAATCC 
TGGTGTCCCTGTTGATACCGGGAAGCCCTGGGCCAACTTTTGGCGAAAATGAGACGTTGA 
T CGG C ACGT AAGAG G T T C C AACTTT C AC C ATAATG AAATAAG AT C ACT AC CG G G CGT ATT 

TTTTGAGTTATCGAGATTTTCAGGAGCTAAGGAAGCTAAAATGGAGAAAAAAATCACTGG 
ATATAC CAC CGTTGATATATC C CAATGGCATCGTAAAGAACATTTTGAGG C ATTTCAGTC 
AGTTGCTCAATGTACCTATAACCAGACCGTTCAGCTGGATATTACGGCCTTTTTAAAGAC 
CGTAAAGAAAAATAAGCACAAGTTTTATCCGGCCTTTATTCACATTCTTGCCCGCCTGAT 
GAATGCTCATCCGGAATTCCGTATGGCAATGAAAGACGGTGAGCTGGTGATATGGGATAG 
TGTTCACCCTTGTTACACCGTTTTCCATGAGCAAACTGAAACGTTTTCATCGCTCTGGAG 
TGAATACCACGACGATTTCCGGCAGTTTCTACACATATATTCGCAAGATGTGGCGTGTTA 
CGGTGAAAACCTGGCCTATTTCCCTAAAGGGTTTATTGAGAATATGTTTTTCGTCTCAGC 
CAATCCCTGGGTGAGTTTCACCAGTTTTGATTTAAACGTGGCCAATATGGACAACTTCTT 
CGCCCCCGTTTTCACCATGGGCAAATATTATACGCAAGGCGACAAGGTGCTGATGCCGCT 
GGCGATTCAGGTTCATCATGCCGTCTGTGATGGCTTCCATGTCGGCAGAATGCTTAATGA 
ATTACAACAGTACTGCGATGAGTGGCAGGGCGGGGCGTAAACGCGTGGATCCGGCTTACT 
AAAAGCCAGATAACAGTATGCGTATTTGCGCGCTGATTTTTGCGGTATAAGAATATATAC 
TGATATGTATACCCGAAGTATGTCAAAAAGAGGTGTGCTATGAAGCAGCGTATTACAGTG 
AC AG T TGA C AG CG AC AG CTAT C AGTTG C T C AAGG C ATATATG ATGT C AATAT CTCCGGTC 

TGGTAAGCACAACCATGCAGAATGAAGCCCGTCGTCTGCGTGCCGAACGCTGGAAAGCGG 
AAAATCAGGAAGGGATGGCTGAGGTCGCCCGGTTTATTGAAATGAACGGCTCTTTTGCTG 
ACGAGAACAGGGACTGGTGAAATGCAGTTTAAGGTTTACACCTATAAAAGAGAGAGCCGT 
TATCGTCTGTTTGTGGATGTACAGAGTGATATTATTGACACGCCCGGGCGACGGATGGTG 
ATCCCCCTGGCCAGTGCACGTCTGCTGTCAGATAAAGTCTCCCGTGAACTTTACCCGGTG 
GTGCATATCGGGGATGAAAGCTGGCGCATGATGACCACCGATATGGCCAGTGTGCCGGTC 
TCCGTTATCGGGGAAGAAGTGGCTGATCTCAGCCACCGCGAAAATGACATCAAAAACGCC 
ATTAACCTGATGTTCTGGGGAATATAAATGTCAGGCTCCGTTATACACAGCCAGTCTGCA 
GGTCGATACAGTAGAAATTACAGAAACTTTATCACGTTTAGTAAGTATAGAGGCTGAAAA 
TCCAGATGAAGCCGAACGACTTGTAAGAGAAAAGTATAAGAGTTGTGAAATTGTTCTTGA 
TGC AG ATGATTT T CAGG AC TATG ACACTAGCATAT ATG AATAGG TAGATGTTTTTAT TTT 
GTCACACAAAAAAGAGGCTCGCACCTCTTTTTCTTATTTCTTTTTATGATTTAATA 
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Figure .5T M ^Kl) aone of CAT Subcloned . Qto 



1021 



»" - X Ui lit ss & ss ss ;nl;1f * & < T * 3rsn£ 



« tfft " ~ *s s ss'SZ s; «j g 



1072 





ft* Sr. fft ^ Tv> 



cac cat cac ggc ate laca age too tic 
Stg gta gtg ccg tag fcgt tea aacSato 



From pDrsri 



5: 



gee, fe g& ate 

ttt cqt ccg afc a ctt 



Ma 



ctg 
ttg gac 



1123 



1 & ft » Si * .8 % * » * * & * * * $ 



5029 J^sU 
5029 J>pu 101 



MhiU3 
BclU7 

.Bspl20L220 
LApaU20 



4*153*1107. 
*614>UeI. 
4614XcoMI.\ 
450SX*aUI. 



4246.Sapi.. 
4i24,BspLUU.. 



3710^MwKl 




..BspKH.865 
.RsrIL993 



...BspEL1347 



B1JLI6I6 



?tKpoM91S 
\Asp7!S.!91S 
'Hia<fflU924 



301IJ=$pI 

2S64JMU 



1843 

Ndel 

StuI 

Sail 

Saci 

Eel I 36 

Spel 

NoU 

XmaHI 

BamHI 

1900 



WO 00/52027 



PCT/US00/05432 





PCT7US00/05432 




PCT/US00/05432 



WO 00/52027 



PCT/US00/05432 




X 



E. coli Gencme 

-¥/- 



4 2 3 

attB (25 fcp> 



Integration 
<3nt, Ef) 



1 Excision 
Unt,Xis,IHF) 



E. coli Genome 



attL 



attR 



4 2 5 1 2 3 

Lambda Genome 



WO 00/52027 f j PCT/US00/05432 




00/52027 



PCT/US00/05432 



Native Protein Expression: 
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MLu I (reading frame A) 
Bgl n (reading frame B) 
Xba I (reading frame C) 
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Results of Cloning 
tet and amp PCR Products 

by R©6om1bii&&ii®En 



FCK JrrcMliict 


ino* isotonics 


JD Ui III VI 




uscu in virv^o 




DNA 


Obtained of 


"D Aa Ati/tn ft 


(100 «1 pitted) 


Analyzed 


Predicted 








Size 


tet 


6.10 


sc 


Oof 8 


attB-tet 


9,6 


sc 


lof8 


attB+4G-tet 


824, 1064 


sc 


7 of 7 




Aval+Bam 


7 of 7 


amp 


7,13 


SC 


Oof 8 


attB-amp 


18,22 


SC 


3 of 8 


attB+4G-amp 


3020, 3540 


SC 


8 of 8 




PstI 


8 of 8 


attB Plasmid 


320, 394 






(Pos. Control) 
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Titration of attP amounts with various attB PGR amounts in 

Sxl? Reaction 
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Titration of 256 bp PCR in p 
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Titration of 1 kb PCR in 
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Titration of 1.4 kb PCR in i 3* * 
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Titration of 3-4 kb PCR in B *> 
Reaction 
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Titration of 4.6 kb PCR in 
Reaction 
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6.9 kb PGR DNA Titration in'q f^p • Reaction 
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Titration of 10.1 kb PGR tn £?x 
Reaction 
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Cloning of PCR Products of Different Sizes with the 
GATEWAY™ PCR Cloning System 



Size 


fmols PCR 


ng PCR 


Cols/ml 


Correct 




DNA 


DNA 


Transformation 


Clones/Total 








(pUC=10 8 CFU/mI) 


Examined** 


0.26 kb* 


15 


3 


1223 


10/10 (a) 




37.5 


7.5 


2815 


1.0 kb 


15 


10 


507 


49/50 (b) 




37.5 


25 


1447 


1.4 kb 


15 


14 


271 


48/50 (c) 




37.5 


35 


683 


3.4 kb 


15 


34 


478 


9/10 (a) 




37.5 


85 


976 


4.6 kb 


15 


46 


190 


10/10 (a) 




37.5 


115 


195 


6.9 kb 


15 


69 


30 (235)** 


47/50 (b) 




37.5 


173 


54 (463)** 





*The 0.26 kb PCR product was used unpurifled; all the others were purified by precipitation with 
PEG/MgCl 2 as described in the text of Example 9, to remove primer dimers potentially present. 
Standard incubations were for 60 min. 



** overnight incubation 

(a) DNA minipreps 

(b) ampR/kanR 

(c) tetR/kanR 



Figure 77 
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Reading frame A: 



EcoR V 
1/2 site 



ATC 
TAG 



attR1 



ACA AGT Ti t; TAC AAA AAA 
TGT TCA AAC ATG TTT ItTT 



Reading frame B: 
attR1 

A TCA [ ACA AGT TT fe TAC AAA- AAA 

T AGT | TGT TCA AAC ATG TTT~|TTT 



attR2 



Reading frame C: 
attRi 



AT CAaI ACA AGT Ti b- TAC AAA AAA 
TA G TT| TGT TCA AAC ATG TTt|tTT 



-[Cnf ^ -' r] l C TTG TAC AAA GTG G£3 



AAG AAC ATG TTT CAC CAZ 



EcoR V 
1/2 site 

AT 
TA 



attR2 



-[Cnf -mrt ^f-TqC TTG TAC AAA GTG GTT GAT 
| A AAG AAC ATG I TTT CAC CA ^ CTA 



attR2 



-[Cnf TTG TAC AAA GTG Glfe__CGA T 



A AAG AAC ATG I TTT CAC C&fr, GCT A 



Reading frame C: 



an Rl 



AT CAA 



TA GTT 



ACA AGT TT^^^^WAy -fcmR-ccdBl-^ 
TGT TCA AAC ATG TTT *«*+t** ^ ccaJ3J * 



m K2 




TTT CAC CfcA ACT A 
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Fusion protein 

codon Reading frame A cassette 

± 



' \/ \ 

nnn nnn ate aca agt ttg tac aaa aaa get 

nnn nnn t ^ag tgt te a aac atg ttt ttt cga 



attR 1 



Reading frame B cassette 

iK 1 ^ 

nnn nnn rma te a a ca agt ttg tac aaa aaa get 

nnn nnn nnt agt tgt tea aac atg ttt ttt cga 



cannot be TG or TA 



Reading* frame C cassette 
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— nnn nnn nat c aa a ca agt ttg tac aaa aaa get 

— nnn nnn nta gtt tgt tea aac atg ttt ttt cga 
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prfC Parent III 4554 bp 



Location (Base Nos . ) 
410 . . 286 



Gene Encoded 



attRl 



660 . . 1319 
1439 . . 1523 
1661 . . 1966 
2007 . .2131 
2753 . .3613 



CmR 

inactivated ccdA 

ccdB 

attR2 



amp 



1 GCGCCCAATA CGCAAACCGC CTCTCCCCGC GCGTTGGCCG ATTCATTAAT GCAGCTGGCA 
61 CGACAGGTTT CCCGACTGGA AAGCGGGCAG TGAGCGCAAC GCAATTAATG TGAGTTAGCT 
121 CACTCATTAG GCACCCCAGG CTTTACACTT TATGCTTCCG GCTCGTATGT TGTGTGGAAT 
181 TGTGAGCGGA TAACAATTTC ACACAGGAAA CAGCTATGAC CATGATTACG CCAAGCTTGC 
241 ATGCCTGCAG GTCGACTCTA GAGGATCCCC GGGTACCGAT AT C AAAC AAG TTTGTACAAA 
3 01 AAAGCTGAAC GAGAAACGTA AAATGATATA AATATCAATA TATTAAATTA GATTTTGCAT 

3 61 AAAAAACAGA C T AC AT AAT A CTGTAAAACA CAACATATCC AGTCACTATG GCGGCCGCTA 

4 21 AGTTGGCAGC ATCACCCGAC GCACTTTGCG CCGAATAAAT ACCTGTGACG G AAG AT C AC T 
4 81 TCGCAGAATA AATAAATCCT GGTGTCCCTG TTGATACCGG GAAGCCCTGG GCCAACTTTT 
541 GGCGAAAATG AGACGTTGAT CGGCACGTAA GAGGTTCCAA CTTTCACCAT AATGAAATAA 
6 01 GATCACTACC GGGCGTATTT TTTGAGTTAT CGAGATTTTC AGGAGCTAAG GAAGCTAAAA 

6 61 TGGAGAAAAA AATCACTGGA TATACCACCG TTGATATATC CCAATGGCAT CGTAAAGAAC 
721 ATTTTGAGGC ATTTCAGTCA GTTGCTCAAT GTACCTATAA CCAGACCGTT CAGCTGGATA 

7 81 TTACGGCCTT TTTAAAGACC GTAAAGAAAA ATAAGCACAA GTTTTATCCG GCCTTTATTC 
841 ACATTCTTGC CCGCCTGATG AATG C T CAT C CGGAATTCCG TATGGCAATG AAAGACGGTG 
9 01 AGCTGGTGAT ATGGGATAGT GTTCACCCTT GTTACACCGT TTTCCATGAG CAAACTGAAA 
961 CGTTTTCATC GCTCTGGAGT G AAT AC C AC G ACGATTTCCG GCAGTTTCTA CACATATATT 

1021 CGCAAGATGT GGCGTGTTAC GGTGAAAACC TGGCCTATTT CCCTAAAGGG TTTAT TGAGA 
1081 ATATGTTTTT CGTCTCAGCC AATCCCTGGG TGAGTTTCAC CAGTTTTGAT TTAAACGTGG 
1141 CCAATATGGA CAACTTCTTC GCCCCCGTTT TCACCATGGG CAAATATTAT ACGCAAGGCG 
12 01 ACAAGGTGCT GATGCCGCTG GCGATTCAGG TTCATCATGC CGTCTGTGAT GGCTTCCATG 

12 61 TCGGCAGAAT GCTTAATGAA TTACAACAGT ACTGCGATGA GTGGCAGGGC GGGGCGTAAT 

13 21 CTAGAGGATC CGGCTTACTA AAAGC CAGAT AACAGTATGC GTATTTGCGC GCTGATTTTT 

13 81 GCGGTATAAG AAT AT AT AC T GATATGTATA CCCGAAGTAT GTCAAAAAGA GGTGTGCTAT 

14 41 GAAGCAGCGT ATTACAGTGA CAGTTGACAG CGAC AG CT AT CAGTTGCTCA AGGCATATAT 

15 01 GATGTCAATA TCTCCGGTCT GGTAAGCACA AC C ATG C AG A ATGAAGCCCG TCGTCTGCGT 
1561 GCCGAACGCT GGAAAGCGGA AAATCAGGAA GGGATGGCTG AGGTCGCCCG GTTTATTGAA 
1621 ATGAACGGCT CTTTTG CTGA C G AG AAC AG G GACTGGTGAA ATGCAGTTTA AGGT T T AC AC 

16 81 CTATAAAAGA GAGAGCCGTT ATCGTCTGTT TGTGGATGTA CAGAGTGATA TTATTGACAC 
1741 GCCCGGGCGA' CGGATGGTGA TCCCCCTGGC CAGTGCACGT CTGCTGTCAG ATAAAGTCTC 
18 01 CCGTGAACTT TAC CCGGTGG TGCATATCGG GGATGAAAGC TGGCGCATGA T G AC C AC C GA 
1861 TATGGCCAGT GTGCCGGTCT CCGTTATCGG GGAAGAAGTG GCTGATCTCA GCCACCGCGA 
1921' AAATGACATC AAAAACGCCA TTAACCTGAT GTTCTGGGGA ATATAAATGT CAGGCTCCGT 
1981 TAT AC AC AG C CAGTCTGCAG GT C G AC CAT A GTGACTGGAT ATGTTGTGTT TTACAGTATT 
2 041 ATGTAGTCTG TTTTTTATGC AAAAT CT AAT TTAATATATT GATATTTATA TCATTTTACG 
2101 TTTCTCGTTC AGCTTTCTTG TACAAAGTGG TTCGATATCG GTACCGAGCT CGAATTCACT 
2161 GGCCGTCGTT TTACAACGTC GTGACTGGGA AAACCCTGGC GTTACCCAAC TTAATCGCCT 
22 21 TG C AG C AC AT CCCCCTTTCG CCAGCTGGCG T AAT AG C G AA GAGGCCCGCA CCGATCGCCC 
22 81 TTCCCAACAG TTGCGCAGCC TGAATGGCGA ATGGCGCCTG ATGCGGTATT TTCTCCTTAC 
2 3 41 GCATCTGTGC GGTATTTCAC AC CG CAT AT G GTGCACTCTC AG TAC AAT C T GCTCTGATGC 
24 01 CGCATAGTTA AGCCAGCCCC GACACCCGCC AACACCCGCT GACGCGCCCT GACGGGCTTG 
24 61 TCTGCTCCCG GCATCCGCTT AC AG AC AAG C TGTGAC CGTC TCCGGGAGCT GCATGTGTCA 
2 521 GAGGTTTTCA CCGTCATCAC CGAAACGCGC GAG AC G AAAG GGCCTCGTGA TAC G C CT ATT 
2 581 TTTATAGGTT AATGTCATGA TAATAATGGT TTCTTAGACG TCAGGTGGCA CTTTTCGGGG 
2 641 AAATGTGCGC GGAACCCCTA TTTGTTTATT TTTCTAAATA CATTCAAATA TGTATCCGCT 
2 701 CATGAGACAA T AAC C C TG AT AAATGCTTCA AT AAT ATT G A AAAAGGAAGA GTATGAGTAT 

2 761 TCAACATTTC CGTGTCGCCC TTATTCCCTT TTTTGCGGCA TTTTGCCTTC CTGTTTTTGC - 
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2 821 TCACCCAGAA ACGCTGGTGA AAGTAAAAGA TGCTGAAGAT CAGTTGGGTG CACGAGTGGG 
2881 TTACATCGAA CTGGATCTCA ACAGCGGTAA GATCCTTGAG AGTTTTCGCC CCGAAGAACG 

2 941 TTTTCCAATG ATGAGCACTT TTAAAGTTCT GCTATGTGGC GCGGTATTAT CCCGTATTGA 

3 001 CGCCGGGCAA GAGCAACTCG GTCGCCGCAT ACACTATTCT CAGAATGACT TGGTTGAGTA 
3 061 CTCACCAGTC AC AG AAAAG C ATCTTACGGA TGG CATGAC A GTAAGAGAAT TATGCAGTGC 
3121 TGCCATAACC ATGAGTGATA ACACTGCGGC CAACTTACTT CTGACAACGA TCGGAGGACC 
3181 GAAGGAGCTA ACCGCTTTTT TGCACAACAT GGGGGATCAT GTAACTCGCC TTGATCGTTG 
3 241 GGAACCGGAG CTGAATGAAG C CAT AC C AAA C G AC GAG C GT GACACCACGA TGCCTGTAGC 
33 01 AATGGCAACA ACGTTGCGCA AACTATTAAC TGGCGAACTA CTTACTCTAG CTTCCCGGCA 
3 3 61 ACAATTAATA GACTGGATGG AGGCGGATAA AGTTGCAGGA CCACTTCTGC GCTCGGCCCT 
3421 TCCGGCTGGC TGGTTTATTG CTGATAAATC TGGAGCCGGT GAGCGTGGGT CTCGCGGTAT 
3481 CAT TG C AG C A CTGGGGCCAG ATGGTAAGCC CTCCCGTATC GTAGT TAT C T ACACGACGGG 
3 541 GAGTCAGGCA ACTATGGATG AACGAAATAG ACAGATCGCT GAGATAGGTG CCTCACTGAT 
3 601 TAAG CAT TGG TAACTGTCAG ACCAAGTTTA C T CAT AT AT A C TT TAG AT TG ATTTAAAACT 
3661 TCATTTTTAA TTTAAAAGGA TCTAGGTGAA GATCCTTTTT GATAATCTCA TGACCAAAAT 
3 721 CCCTTAACGT GAGTTTTCGT T C C AC TG AG C GTCAGACCCC GTAGAAAAGA TCAAAGGATC 
3 7 81 TT C T T GAG AT CCTTTTTTTC TGCGCGTAAT CTGCTGCTTG CAAACAAAAA AACCACCGCT 
3 841 ACCAGCGGTG GTTTGTTTGC CGGATCAAGA GCTACCAACT CTTTTTCCGA AGGTAACTGG 
3 901 C T T C AG C AG A GCGCAGATAC CAAATACTGT CCTTCTAGTG TAGCCGTAGT TAGGCCACCA 

3 961 CTTCAAGAAC TCTGTAGCAC CG C C T AC ATA CCTCGCTCTG CTAATCCTGT TACCAGTGGC 

4 021 TGCTGCCAGT GGCGATAAGT CGTGTCTTAC CGGGTTGGAC TCAAGACGAT AG TT AC C G G A 
4 081 TAAGGCGCAG CGGTCGGGCT GAACGGGGGG TTCGTGCACA CAGCCCAGCT T G GAG CGAAC 
4141 GACCTACACC GAACTGAGAT ACCTACAGCG TGAGCTATGA GAAAGCGCCA CGCTTCCCGA 
42 01 AGGGAGAAAG GCGGACAGGT AT C CG GT AAG CGGCAGGGTC GGAACAGGAG AGCGCACGAG 

42 61 GGAGCTTCCA GGGGGAAACG CCTGGTATCT TTATAGTCCT GTCGGGTTTC GCCACCTCTG 
4 321 ACTTGAGCGT CGATTTTTGT GATGCTCGTC AGGGGGGCGG AGCCTATGGA AAAACGCCAG 

43 81 CAACGCGGCC TTTTTACGGT TCCTGGCCTT TTGCTGGCCT TTTGCTCACA TGTTCTTTCC 
4441 TGCGTTATCC CCTGATTCTG TGGATAACCG TATTACCGCC TTTGAGTGAG CTGATACCGC 
4 5 01 TCGCCGCAGC CGAACGACCG AG CG C AG C G A GTCAGTGAGC GAGGAAGCGG AAGA 
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pDEST2 8 7141 bp 

ATGCATGTCGTTACATAACTTACGGTAAATGGCCCGCCTGGCTGACCGCCCAACGACCCC 

CGCCCATTGACGTCAATAATGACGTATGTTCCCATAGTAACGCCAATAGGGACTTTCCAT 

TGACGTCAATGGGTGGAGTATTTACGGTAAACTGCCCACTTGGCAGTACATCAAGTGTAT 

CATATGCCAAGTACGCCCCCTATTGACGTCAATGACGGTAAATGGCCCGCCTGGCATTAT 

GCCCAGTACATGACCTTATGGGACTTTCCTACTTGGCAGTACATCTACGTATTAGTCATC 

GCTATTACCATGGTGATGCGGTTTTGGCAGTACATCAATGGGCGTGGATAGCGGTTTGAC 

TCACGGGGATTTCCAAGTCTCCACCCCATTGACGTCAATGGGAGTTTGTTTTGGCACCAA 

AATCAACGGGACTTTCCAAAATGTCGTAACAACTCCGCCCCATTGACGCAAATGGGCGGT 

AGGCGTGTACGGTGGGAGGTCTATATAAGCAGAGCTCTCCCTATCAGTGATAGAGATCTC 

CCTATCAGTGATAGAGATCGTCGACGAGCTCGTTTAGTGAACCGTCAGATCGCCTGGAGA 

CGCCATCCACGCTGTTTTGACCTCCATAGAAGACACCGGGACCGATCCAGCCTCCGGACT 

CTAGAGGATCCCTACCGGTGATATCCTCGAGCCCATCAACAAGTTTGTACAAAAAAGCTG 

AACGAGAAACGTAAAATGATATAAATATCAATATATTAAATTAGATTTTGCATAAAAAAC 

AGACTACATAATACTGTAAAACACAACATATCCAGTCACTATGGCGGCCGCATTAGGCAC 

CCCAGGCTTTACACTTTATGCTTCCGGCTCGTATAATGTGTGGATTTTGAGTTAGGATCC 

GGCGAGATTTTCAGGAGCTAAGGAAGCTAAAATGGAGAAAAAAATCACTGGATATACCAC 

CGTTGATATATCCCAATGGCATCGTAAAGAACATTTTGAGGCATTTCAGTCAGTTGCTCA 

ATGTACCTATAACCAGACCGTTCAGCTGGATATTACGGCCTTTTTAAAGACCGTAAAGAA 

AAATAAGCACAAGTTTTATCCGGCCTTTATTCACATTCTTGCCCGCCTGATGAATGCTCA 

TCCGGAATTCCGTATGGCAATGAAAGACGGTGAGCTGGTGATATGGGATAGTGTTCACCC 

TTGTTACACCGTTTTCCATGAGCAAACTGAAACGTTTTCATCGCTCTGGAGTGAATACCA 

CGACGATTTCCGGCAGTTTCTACACATATATTCGCAAGATGTGGCGTGTTACGGTGAAAA 

CCTGGCCTATTTCCCTAAAGGGTTTATTGAGAATATGTTTTTCGTCTCAGCCAATCCCTG 

GGTGAGTTTCACCAGTTTTGATTTAAACGTGGCCAATATGGACAACTTCTTCGCCCCCGT 

TTTCACCATGGGCAAATATTATACGCAAGGCGACAAGGTGCTGATGCCGCTGGCGATTCA 

GGTTCATCATGCCGTCTGTGATGGCTTCCATGTCGGCAGAATGCTTAATGAATTACAACA 

GTACTGCGATGAGTGGCAGGGCGGGGCGTAAAGATCTGGATCCGGCTTACTAAAAGCCAG 

ATAACAGTATGCGTATTTGCGCGCTGATTTTTGCGGTATAAGAATATATACTGATATGTA 

TAC C CGAAGT ATGTCAAAAAGAGGTGTGCTATGAAGCAGCGTATTACAGTGAC AGTTGAC 

AGCGACAGCTATCAGTTGCTCAAGGCATATATGATGTCAATATCTCCGGTCTGGTAAGCA 

CAACCATGCAGAATGAAGCCCGTCGTCTGCGTGCCGAACGCTGGAAAGCGGAAAATCAGG 

AAGGGATGGCTGAGGTCGCCCGGTTTATTGAAATGAACGGCTCTTTTGCTGACGAGAACA 

GGGACTGGTGAAATG CAGTTT AAGGTTTACAC CTATAAAAGAGAGAGC CGTT AT CGTCTG 

TTTGTGGATGTACAGAGTGATATTATTGACACGCCCGGGCGACGGATGGTGATCCCCCTG 

GCCAGTGCACGTCTGCTGTCAGATAAAGTCTCCGGTGAACTTTACCCGGTGGTGCATATC 

GGGGATGAAAGCTGGCGCATGATGACCACCGATATGGCCAGTGTGCCGGTCTCCGTTATC 

GGGGAAGAAGTGGCTGATCTCAGCCACCGCGAAAATGACATCAAAAACGCCATTAACCTG 

ATGTTCTGGGGAATATAAATGTCAGGCTCCCTTATACACAGCCAGTCTGCAGGTCGACCA 

TAGTGACTGGATATGTTGTGTTTTACAGTATTATGTAGTCTGTTTTTTATGCAAAATCTA 

ATTT AATATATTGAT ATTT AT AT C ATTTT ACGTTT CTCGTT C AG CTTT CTTGT ACAAAGT 

GGTTGATGGGCGGCCGCTCTAGAGGGCCCAAGCTTACGCGTGCATGCGACGTCATAGCTC 

TCTCCCTATAGTGAGTCGTATTATAAGCTAGGCACTGGCCGTCGTTTTACAACGTCGTGA 

CTGGGAAAACTGCTAGCTTGGGATCTTTGTGAAGGAACCTTACTTCTGTGGTGTGACATA 

ATTGGACAAACTACCTACAGAGATTTAAAGCTCTAAGGTAAATATAAAATTTTTAAGTGT 

ATAATGTGTTAAACTAGCTGCATATGCTTGCTGCTTGAGAGTTTTGCTTACTGAGTATGA 

TTTATGAAAATATTATACACAGGAGCTAGTGATTCTAATTGTTTGTGTATTTTAGATTCA 

CAGTCCCAAGGCTCATTTCAGGCCCCTCAGTCCTCACAGTCTGTTCATGATCATAATCAG 

CCATACCACATTTGTAGAGGTTTTACTTGCTTTAAAAAACCTCCCACACCTCCCCCTGAA 

CCTGAAACATAAAATGAATGCAATTGTTGTTGTTAACTTGTTTATTGCAGCTTATAATGG 

TTACAAATAAAGCAATAGCATCACAAATTTCACAAATAAAGCATTTTTTTCACTGCATTC 

T AGTTGTGGTTTGT CC AAACT CAT C AATGTAT C TT AT CATGT CTGGAT CGAT C CTG CATT 

AATGAATCGGCCAACGCGCGGGGAGAGGCGGTTTGCGTATTGGCTGGCGTAATAGCGAAG 

AGGCCCGCACCGATCGCCCTTCCCAACAGTTGCGCAGCCTGAATGGCGAATGGGACGCGC 

CCTGTAGCGGCGCATTAAGCGCGGCGGGTGTGGTGGTTACGCGCAGCGTGACCGCTACAC 

TTGCCAGCGCCCTAGCGCCCGCTCCTTTCGCTTTCTTCCCTTCCTTTCTCGCCACGTTCG 

CCGGCTTTCCCCGTCAAGCTCTAAATCGGGGGCTCCCTTTAGGGTTCCGATTTAGTGCTT- 
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TACGGCACCTCGACCCCAAAAAACTTGATTAGGGTGATGGTTCACGTAGTGGGCCATCGC 

CCTGATAGACGGTTTTTCGCCCTTTGACGTTGGAGTCCACGTTCTTTAATAGTGGACTCT 

TGTTCCAAACTGGAACAACACTCAACCCTATCTCGGTCTATTCTTTTGATTTATAAGGGA 

TTTTGCCGATTTCGGCCTATTGGTTAAAAAATGAGCTGATTTAACAAATATTTAACGCGA 

ATTTTAACAAAATATTAACGTTTACAATTTCGCCTGATGCGGTATTTTCTCCTTACGCAT 

CTGTGCGGTATTTCACACCGCATACGCGGATCTGCGCAGCACCATGGCCTGAAATAACCT 

CTGAAAGAGGAACTTGGTTAGGTACCTTCTGAGGCGGAAAGAACCAGCTGTGGAATGTGT 

GTCAGTTAGGGTGTGGAAAGTCCCCAGGGTCCCCAGCAGGCAGAAGTATGCAAAGCATGC 

ATCTCAATTAGTCAGCAACCAGGTGTGGAAAGTCCCCAGGCTCCCCAGCAGGCAGAAGTA 

TG C AAAG C ATG C AT C T C AAT T AGT C AG C AAC C AT AGT C C CG C C C C T AACT C C G C C CAT C C 

CGCCCCTAACTCCGCCCAGTTCCGCCCATTCTCCGCCCCATGGCTGACTAATTTTTTTTA 

TTTATGCAGAGGCCGAGGCCGCCTCGGCCTCTGAGCTATTCCAGAAGTAGTGAGGAGGCT 

TTTTTGGAGGCCTAGGCTTTTGCAAAAAGCTTGATTCTTCTGACACAACAGTCTCGAACT 

TAAGACCATGGCCAAGCCTTTGTCTCAAGAAGAATCCACCCTCATTGAAAGAGCAACGGC 

TACAATCAACAGCATCCCCATCTCTGAAGACTACAGCGTCGCCAGCGCAGCTCTCTCTAG 

CGACGGCCGCATCTTCACTGGTGTCAATGTATATCATTTTACTGGGGGACCTTGTGCAGA 

ACTCGTGGTGCTGGGCACTGCTGCTGCTGCGGCAGCTGGCAACCTGACTTGTATCGTCGC 

GATCGGAAATGAGAACAGGGGCATCTTGAGCCCCTGCGGACGGTGCCGACAGGTGCTTCT 

CGATCTGCATCCTGGGATCAAAGCCATAGTGAAGGACAGTGATGGACAGCCGACGGCAGT 

TGGGATTCGTGAATTGCTGCCCTCTGGTTATGTGTGGGAGGGCTAAGCACTTCGTGGCCG 

AGTTCGAAATGACCGACCAAGCGACGCCCAACCTGCCATCACGATGGCCGCAATAAAATA 

TCTTTATTTTCATTACATCTGTGTGTTGGTTTTTTGTGTGAATCGATAGCGATAAGGATC 

CGCGTATGGTGCACTCTCAGTACAATCTGCTCTGATGCCGCATAGTTAAGCCAGCCCCGA 

CACCCGCCAACACCCGCTGACGCGCCCTGACGGGCTTGTCTGCTCCCGGCATCCGCTTAC 

AGACAAGCTGTGACCGTCTCCGGGAGCTGCATGTGTCAGAGGTTTTCACCGTCATCACCG 

AAACGCGCGAGACGAAAGGGCCTCGTGATACGCCTATTTTTATAGGTTAATGTCATGATA 

ATAATGGTTTCTTAGACGTCAGGTGGCACTTTTCGGGGAAATGTGCGCGGAACCCCTATT 

TGTTTATTTTTCTAAATACATTCAAATATGTATCCGCTCATGAGACAATAACCCTGATAA 

ATGCTT CAATAATATTGAAAAAGGAAGAGTATGAGTATTC AACATTT CCGTGT CGC C CTT 

ATTCCCTTTTTTGCGGCATTTTGCCTTCCTGTTTTTGCTCACCCAGAAACGCTGGTGAAA 

GTAAAAGATGCTGAAGATCAGTTGGGTGCACGAGTGGGTTACATCGAACTGGATCTCAAC 

AGCGGTAAGATCCTTGAGAGTTTTCGCCCCGAAGAACGTTTTCCAATGATGAGCACTTTT 

AAAGTTCTGCTATGTGGCGCGGTATTATCCCGTATTGACGCCGGGCAAGAGCAACTCGGT 

CGCCGCATACACTATTCTCAGAATGACTTGGTTGAGTACTCACCAGTCACAGAAAAGCAT 

CTTACGGATGGCATGACAGTAAGAGAATTATGCAGTGCTGCCATAACCATGAGTGATAAC 

ACTGCGGCCAACTTACTTCTGACAACGATCGGAGGACCGAAGGAGCTAACCGCTTTTTTG 

CACAACATGGGGGATCATGTAACTCGCCTTGATCGTTGGGAACCGGAGCTGAATGAAGCC 

ATACCAAACGACGAGCGTGACACCACGATGCCTGTAGCAATGGCAACAACGTTGCGCAAA 

CTATTAACTGGCGAACTACTTACTCTAGCTTCCGGGGAACAATTAATAGACTGGATGGAG 

GCGGATAAAGTTGCAGGACCACTTCTGCGCTCGGCCCTTCCGGCTGGCTGGTTTATTGCT 

GATAAATCTGGAGCCGGTGAGCGTGGGTCTCGCGGTATCATTGCAGCACTGGGGCCAGAT 

GGTAAGCCCTCCCGTATCGTAGTTATCTACACGACGGGGAGTCAGGCAACTATGGATGAA 

CGAAATAGACAGATCGCTGAGATAGGTGCCTCACTGATTAAGCATTGGTAACTGTCAGAC 

CAAGTTTACTCATATATACTTTAGATTGATTTAAAACTTCATTTTTAATTTAAAAGGATC 

TAGGTGAAGATCCTTTTTGATAATCTCATGACCAAAATCCCTTAACGTGAGTTTTCGTTC 

CACTGAGCGTCAGACCCCGTAGAAAAGATCAAAGGATCTTCTTGAGATCCTTTTTTTCTG 

CGCGTAATCTGCTGCTTGCAAACAAAAAAACCACCGCTACCAGGGGTGGTTTGTTTGCCG 

GATCAAGAGCTACCAACTCTTTTTCCGAAGGTAACTGGCTTCAGCAGAGCGCAGATACCA 

AATACTGTCCTTCTAGTGTAGCCGTAGTTAGGCCACCACTTCAAGAACTCTGTAGCACCG 

CCTACATACCTCGCTCTGCTAATCCTGTTACCAGTGGCTGCTGCCAGTGGCGATAAGTCG 

TGTCTTACCGGGTTGGACTCAAGACGATAGTTACCGGATAAGGCGCAGCGGTCGGGCTGA 

ACGGGGGGTTCGTGCACACAGCCCAGCTTGGAGCGAACGACCTACACCGAACTGAGATAC 

CTACAGCGTGAGCATTGAGAAAGCGCCACGCTTCCCGAAGGGAGAAAGGCGGACAGGTAT 

C C GGT AAG CGG C AGGGT CGGAACAGGAGAG CGCACGAGGG AG CTT C C AGGGGG AAAC G C C 

TGGTATCTTTATAGTCCTGTCGGGTTTCGCCACCTCTGACTTGAGCGTCGATTTTTGTGA 

TGCTCGTCAGGGGGGCGGAGCCTATGGAAAAACGCCAGCAACGCGGCCTTTTTACGGTTC 

CTGGCCTTTTGCTGGCCTTTTGCTCACATGTTCTTTCCTGCGTTATCCCCTGATTCTGTG 

GATAACCGTATTACCGCCTTTGAGTGAGCTGATACCGCTCGCCGCAGCCGAACGACCGAG- 
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pDEST2 9 715 6 bp 

ATGCATGTCGTTACATAACTTACGGTAAATGGCCCGCCTGGCTGACCGCCCAACGACCCC 

CGCCCATTGACGTCAATAATGACGTATGTTCCCATAGTAACGCCAATAGGGACTTTCCAT 

TGACGTCAATGGGTGGAGTATTTACGGTAAACTGCCCAGTTGGCAGTACATCAAGTGTAT 

CATATGCCAAGTACGCCCCCTATTGACGTCAATGACGGTAAATGGCCCGCCTGGCATTAT 

GCCCAGTACATGACGTTATGGGACTTTCCTACTTGGCAGTACATCTACGTATTAGTCATC 

GCTATTACCATGGTGATGCGGTTTTGGCAGTACATCAATGGGCGTGGATAGCGGTTTGAC 

TCACGGGGATTTCCAAGTCTCCACCCCATTGACGTCAATGGGAGTTTGTTTTGGCACCAA 

AATCAACGGGACTTTCCAAAATGTCGTAACAACTCCGCCCCATTGACGCAAATGGGCGGT 

AGGCGTGTACGGTGGGAGGT CTATATAAGCAGAG CT CT CC CTAT C AGTG AT AG AG AT C T C 

CCTATCAGTGATAGAGATCGTCGACGAGCTCGTTTAGTGAACCGTCAGATCGCCTGGAGA 

CGCCATCCACGCTGTTTTGACCTCCATAGAAGACACCGGGACCGATCCAGCCTCCGGACC 

ATGGC GT ACTAC CAT C AC CAT C AC CAT CAC AC CGGTG AT AT C CT CGAG C G CAT C AC AAGT 

TTGTACAAAAAAGCTGAACGAGAAACGTAAAATGATATAAATATCAATATATTAAATTAG 

ATTTTGCATAAAAAACAGACTACATAATACTGTAAAACACAACATATCCAGTCACTATGG 

CGGCCGCATTAGGCACCCCAGGCTTTACACTTTATGCTTCCGGCTCGTATAATGTGTGGA 

TTTTGAGTTAGGATCCGGCGAGATTTTCAGGAGCTAAGGAAGCTAAAATGGAGAAAAAAA 

TCACTGGATATACCACCGTTGATATATCCCAATGGCATCGTAAAGAACATTTTGAGGCAT 

TTCAGTCAGTTGCTCAATGTACCTATAACCAGACCGTTCAGCTGGATATTACGGCCTTTT 

TAAAGACCGTAAAGAAAAATAAGCACAAGTTTTATCCGGCCTTTATTCACATTCTTGCCC 

GCCTGATGAATGCTCATCCGGAATTCCGTATGGCAATGAAAGACGGTGAGCTGGTGATAT 

GGGATAGTGTTCACCCTTGTTACACCGTTTTCCATGAGCAAACTGAAACGTTTTCATCGC 

TCTGGAGTGAATACCACGACGATTTCCGGCAGTTTCTACACATATATTCGCAAGATGTGG 

CGTGTTACGGTGAAAACCTGGCCTATTTCCCTAAAGGGTTTATTGAGAATATGTTTTTCG 

TCTCAGCCAATCCCTGGGTGAGTTTCACCAGTTTTGATTTAAACGTGGCCAATATGGACA 

ACTTCTTCGCCCCCGTTTTCACCATGGGCAAATATTATACGCAAGGCGACAAGGTGCTGA 

TGCCGCTGGCGATTCAGGTTCATCATGCCGTCTGTGATGGCTTCCATGTCGGCAGAATGC 

TTAATGAATTACAACAGTACTGCGATGAGTGGCAGGGCGGGGCGTAAACGCGTGGATCCG 

GCTTACTAAAAGCCAGATAACAGTATGCGTATTTGCGCGCTGATTTTTGCGGTATAAGAA 

TATAT ACTGATATGTATACC CGAAGTATGT CAAAAAGAGGTGTGCTATGAAGCAGCGTAT 

TACAGTGACAGTTGACAGCGACAGCTATCAGTTGCTCAAGGCATATATGATGTCAATATC 

TCCGGTCTGGTAAGCACAACCATGCAGAATGAAGCCCGTCGTCTGCGTGCCGAACGCTGG 

AAAGCGGAAAATCAGGAAGGGATGGCTGAGGTCGCCCGGTTTATTGAAATGAACGGCTCT 

TTTGCTGACGAGAACAGGGACTGGTGAAATGCAGTTTAAGGTTTACACCTATAAAAGAGA 

GAGCCGTTATCGTCTGTTTGTGGATGTACAGAGTGATATTATTGACACGCCCGGGCGACG 

GATGGTGATCCCCCTGGCCAGTGCACGTCTGCTGTCAGATAAAGTCTCCCGTGAACTTTA 

CCCGGTGGTGCATATCGGGGATGAAAGCTGGCGCATGATGACCACCGATATGGCCAGTGT 

GCCGGTCTCCGTTATCGGGGAAGAAGTGGCTGATCTCAGCCACCGCGAAAATGACATCAA 

AAACGCCATTAACCTGATGTTCTGGGGAATATAAATGTCAGGCTCCGTTATACACAGCCA 

GTCTGCAGGTCGACCATAGTGACTGGATATGTTGTGTTTTACAGTATTATGTAGTCTGTT 

TTTTATGCAAAATCTAATTTAATATATTGATATTTATATCATTTTACGTTTCTCGTTCAG 

CTTTCTTGTACAAAGTGGTGATGGGCGGCCGCTCTAGAGGGCCCAAGCTTACGCGTGCAT 

GCGACGTCATAGCTCTCTCCCTATAGTGAGTCGTATTATAAGCTAGGCACTGGCCGTCGT 

TTTACAACGTCGTGACTGGGAAAACTGCTAGCTTGGGATCTTTGTGAAGGAACCTTACTT 

CTGTGGTGTGACATAATTGGACAAACTACCTACAGAGATTTAAAGCTCTAAGGTAAATAT 

AAAATTTTTAAGTGTATAATGTGTTAAACTAGCTGCATATGCTTGCTGCTTGAGAGTTTT 

GCTTACTGAGTATGATTTATGAAAATATTATACACAGGAGCTAGTGATTCTAATTGTTTG 

TGTATTTTAGATTCACAGTCCCAAGGCTCATTTCAGGCCCCTCAGTCCTCACAGTCTGTT 

CATGATCATAATCAGCCATACCACATTTGTAGAGGTTTTACTTGCTTTAAAAAACCTCCC 

ACACCTCCCCCTGAACCTGAAACATAAAATGAATGCAATTGTTGTTGTTAACTTGTTTAT 

TGCAGCTTATAATGGTTACAAATAAAGCAATAGCATCACAAATTTCACAAATAAAGCATT 

TTTTTCACTGCATTCTAGTTGTGGTTTGTCCAAACTCATCAATGTATCTTATCATGTCTG 

GATCGATCCTGCATTAATGAATCGGCCAACGCGCGGGGAGAGGCGGTTTGCGTATTGGCT 

GGCGTAATAGCGAAGAGGCCCGCACCGATCGCCCTTCCCAACAGTTGCGCAGCCTGAATG 

GCGAATGGGACGCGCCCTGTAGCGGCGCATTAAGCGCGGCGGGTGTGGTGGTTACGCGCA 

GCGTGACCGCTACACTTGCCAGCGCCCTAGCGCCCGCTCCTTTCGCTTTCTTCCCTTCCT 

TTCTCGCCACGTTCGCCGGCTTTCCCCGTCAAGCTCTAAATCGGGGGCTCCCTTTAGGGT- 
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TCCGATTTAGTGCTTTACGGCACCTCGACCCCAAAAAACTTGATTAGGGTGATGGTTCAC 

GTAGTGGGCCATCGCCCTGATAGACGGTTTTTCGCCCTTTGACGTTGGAGTCCACGTTCT 

TTAATAGTGGACTCTTGTTCCAAACTGGAACAACACTCAACCCTATCTCGGTCTATTCTT 

TTGATTTATAAGGGATTTTGCCGATTTCGGCCTATTGGTTAAAAAATGAGCTGATTTAAC 

AAATATTTAACGGGAATTTTAACAAAATATTAACGTTTACAATTTCGCCTGATGCGGTAT 

TTTCTCCTTACGCATCTGTGCGGTATTTCACACCGCATACGCGGATCTGCGCAGCACCAT 

GG C CT G AAAT AAC CT CTG AAAGAGGAACTTGGTT AGGT ACCTT CTGAGG CGGAAAGAAC C 

AGCTGTGGAATGTGTGTCAGTTAGGGTGTGGAAAGTCCCCAGGCTCCCCAGCAGGCAGAA 

GTATGCAAAGCATGCATCTCAATTAGTCAGCAACCAGGTGTGGAAAGTCCCCAGGCTCCC 

CAGCAGGCAGAAGTATGCAAAGCATGCATCTCAATTAGTCAGCAACCATAGTCCCGCCCC 

TAACTCCGCCCATCCCGCCCCTAACTCCGCCCAGTTCCGCCCATTCTCCGCCCCATGGCT 

GACTAATTTTTTTTATTTATGCAGAGGCCGAGGCCGCCTCGGCCTCTGAGCTATTCCAGA 

AGTAGTGAGGAGGCTTTTTTGGAGGCCTAGGCTTTTGCAAAAAGCTTGATTCTTCTGACA 

CAACAGTCTCGAACTTAAGACCATGGCCAAGCCTTTGTCTCAAGAAGAATCCACCCTCAT 

TGAAAGAGCAACGGCTACAATCAACAGCATCCCCATCTCTGAAGACTACAGCGTCGCCAG 

CGCAGCTCTCTCTAGCGACGGCCGCATCTTCACTGGTGTCAATGTATATCATTTTACTGG 

GGGACCTTGTGCAGAACTCGTGGTGCTGGGCACTGCTGCTGCTGCGGCAGCTGGCAACCT 

GACTTGTATCGTCGCGATCGGAAATGAGAACAGGGGCATCTTGAGCCCCTGCGGACGGTG 

CCGACAGGTGCTTCTCGATCTGCATCCTGGGATCAAAGCCATAGTGAAGGACAGTGATGG 

ACAGCCGACGGCAGTTGGGATTCGTGAATTGCTGCCCTCTGGTTATGTGTGGGAGGGCTA 

AGCACTTCGTGGCCGAGTTCGAAATGACCGACCAAGCGACGCCCAACCTGCCATCACGAT 

GGCCGCAATAAAATATCTTTATTTTCATTACATCTGTGTGTTGGTTTTTTGTGTGAATCG 

AT AG CGAT AAGGAT C C G CGTATGGTGC ACTCT CAGTAC AAT CTGC T CTGATG C CG CAT AG 

TTAAGCCAGCCCCGACACCCGCCAACACCCGCTGACGCGCCCTGACGGGCTTGTCTGCTC 

CCGGCATCCGCTTACAGACAAGCTGTGACCGTCTCCGGGAGCTGCATGTGTCAGAGGTTT 

TCACCGTCATCACCGAAACGCGCGAGACGAAAGGGCCTCGTGATACGCCTATTTTTATAG 

GTTAATGTCATGATAATAATGGTTTCTTAGACGTCAGGTGGCAGTTTTCGGGGAAATGTG 

CGCGGAACCCCTATTTGTTTATTTTTCTAAATACATTCAAATATGTATCCGCTCATGAGA 

CAATAACCCTGATAAATGCTTCAATAATATTGAAAAAGGAAGAGTATGAGTATTCAACAT 

TTCCGTGTCGCCCTTATTCCCTTTTTTGCGGCATTTTGCCTTCCTGTTTTTGCTCACCCA 

GAAACGCTGGTGAAAGTAAAAGATGCTGAAGATCAGTTGGGTGCACGAGTGGGTTACATC 

GAACTGGATCTCAACAGCGGTAAGATCCTTGAGAGTTTTCGCCCCGAAGAACGTTTTCCA 

ATGATGAGCACTTTTAAAGTTCTGCTATGTGGCGCGGTATTATCCCGTATTGACGCCGGG 

CAAGAGCAACTCGGTCGCCGCATACACTATTCTCAGAATGACTTGGTTGAGTACTCACCA 

GTCACAGAAAAGCATCTTACGGATGGCATGACAGTAAGAGAATTATGCAGTGCTGCCATA 

ACCATGAGTGATAACACTGCGGCCAACTTACTTCTGACAACGATCGGAGGACCGAAGGAG 

CTAACCGCTTTTTTGCACAACATGGGGGATCATGTAACTCGCCTTGATCGTTGGGAACCG 

GAGCTGAATGAAGCCATACCAAACGACGAGCGTGACACCACGATGCCTGTAGCAATGGCA 

ACAACGTTGCGCAAACTATTAACTGGCGAAGTACTTACTCTAGCTTCCCGGCAACAATTA 

ATAGACTGGATGGAGGCGGATAAAGTTGCAGGACCACTTCTGCGCTCGGCCCTTCCGGCT 

GGCTGGTTTATTGCTGATAAATCTGGAGCCGGTGAGCGTGGGTCTCGCGGTATCATTGCA 

GCACTGGGGCCAGATGGTAAGCCCTCCCGTATCGTAGTTATCTACACGACGGGGAGTCAG 

GCAACTATGGATGAACGAAATAGACAGATCGCTGAGATAGGTGCCTCACTGATTAAGCAT 

TGGTAACTGTCAGACCAAGTTTACTCATATATACTTTAGATTGATTTAAAACTTCATTTT 

T AAT TT AAAAGG AT CT AGG TG AAG ATC CTTTTTG AT AAT CT C ATG AC C AAAAT C C C TT AA 

CGTGAGTTTTCGTTCCACTGAGCGTCAGACCCCGTAGAAAAGATCAAAGGATCTTCTTGA 

GATCCTTTTTTTCTGCGCGTAATCTGCTGCTTGCAAACAAAAAAACCACCGCTACCAGCG 

GTGGTTTGTTTGCCGGATCAAGAGCTACCAACTCTTTTTCCGAAGGTAACTGGCTTCAGC 

AGAGCGCAGATACCAAATACTGTCCTTCTAGTGTAGCCGTAGTTAGGCCACCACTTCAAG 

AACTCTGTAGCACCGCCTACATACCTCGCTCTGCTAATCCTGTTACCAGTGGCTGCTGCC 

AGTGGCGATAAGTCGTGTCTTACCGGGTTGGACTCAAGACGATAGTTACCGGATAAGGCG 

CAGCGGTCGGGCTGAACGGGGGGTTCGTGCACACAGCCCAGCTTGGAGCGAACGACCTAC 

ACCGAACTGAGATACCTACAGCGTGAGCATTGAGAAAGCGCCACGCTTCCCGAAGGGAGA 

AAGGCGGACAGGTATCCGGTAAGCGGCAGGGTCGGAACAGGAGAGCGCACGAGGGAGCTT 

CCAGGGGGAAACGCCTGGTATCTTTATAGTCCTGTCGGGTTTCGCCACCTCTGACTTGAG 

CGTCGATTTTTGTGATGCTCGTCAGGGGGGCGGAGCCTATGGAAAAACGCCAGCAACGCG 

GCCTTTTTACGGTTCCTGGCCTTTTGCTGGCCTTTTGCTCACATGTTCTTTCCTGCGTTA 

TCCCCTGATTCTGTGGATAACCGTATTACCGCCTTTGAGTGAGCTGATACCGCTCGCCGC- 
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pDEST3 0 7544 bp 

gSStaccatggtgatgcggttttggcagtacatcaatgggcgtggatagcggtttgac 

TCACGGGGATTTCCAAGTCTCCACCCCATTGACGTCAATGGGAGTTTGTTTTGGCACCAA 

aa^ca^cgggactttccaaaatgtcgtaacaactccgccccattgacgcaaatgggcggt 

AGGCGTGTACGGTGGGAGGTCTATATAAGCAGAGCTCTCCCTATCAGTGATAGAGATCTC 
OTA?CAGTG^ 

rScATCCACGCTGTTTTGACCTCCATAGAAGACACCGGGACCGATCCAGCCTCCGGACT 

SaSSatccct^ 

ScSagaaacgtaaaatgatataaatatcaatatattaaattagato^ 

a^actacItaatactgtaaaacacaacatatccagtcactatggcggccgcattaggcac 

?c?aggctttacactttatgcttccggctcgtataatgtgtggattttgagttaggatcc 

ggcgagattttcaggagctaaggaagctaaaatggagaaaaaaatcactggatataccac 

cgttga?atatcccaatggcatcgtaaagaacattttgaggcatttcagtcagttgctca 

Sotacctataaccagaccgttcagctggatattacggcctttttaaagaccgtaaagaa 

aaataagcacaagttttatccggcctttattcacattcttgcccgcctgatgaatgctca 

^ggaat?ccgtatggcaatgaaagacggtgagctggtgatatgggatagtgttcaccc 

ttottacaccgttttccatgagcaaactgaaacgttttcatcgctctggagtgaatacca 

cgacgatttccggcagtttctacacatatattcgcaagatgtggcgtgttacggtgaaaa 

cctSSctatttccctaaagggtttattgagaatatgtttttcgtctcagccaatcc 

gctgagStcaccagt^gatttaaacgtggccaatatggaca^ 
tttcaccatgggcaaatattatacgcaaggcgacaaggtgctgatgccgctggcgattca 

gg?5catcatgccgtctgtgatggcttccatgtcggcagaatgcttaa^ 

g?IctgcS?SagtggcLggcggggcgt^ 

ataacagtatgcgtatttgcgcgctgatttttgcggtataagaatatatactgatatgta 

tIc^cgaIgtatct^ 
Ig1£a^c?at^ 

caa?catgcagaatgaagcccgtcgtctgcgtgccgaacgctggaaagcggaaaatcagg 
aa^ggatggctg^ggtcgcccggtttattgaaatgaacggctcttttgctgacgagaac 
rrr actggtgaaatgcagtttaaggtttacacctataaaagagagagccgttatcgtctg 

???otgga?g^agagtgatattattgacacgcc^ 

gccagtgScgtctgctgtcagataaagtctcccgtgaactttacccggtggtgcata^ 

ggSgatga^gctggcgcatgatgaccaccgatatggccagtgtgccggtctccgttat^ 

ggggaagaagtggctgatctcagccaccgcgaaaatgacatcaaaaacgccattaacctg 

a?g?^?g^ggaatataaatgtcaggctcccttatacacagccagtctgcagg 

tagtgactggatatgttgtgttttacagtattatgtagtctgttttttatgcaaaatcta 

S?taatatattgatatttatatcattttacgtttctcgttcagctttcttgtacaaa^ 

ggttgatgggcggccgctctagagggcccaagcttacgcgtgcatgcgacgtcatagctc 

?SccctatagtgS 

ctgggaaaactgctagcttgggatctttgtgaaggaaccttacttctgtggtgtgacata 
S?ggacaaactacctacagagatttaaagctctaaggtaaatataaaattttta^^ 

a?aa?otg1ta^ 

??^tgaaaatattatacacaggagctagtgattctaattgtttgtgtattttagattca 

cagtccgaaggctcatttcaggcccctcagtcctcacagtctgttcatgatcataatcag 

ccataccacatttgtagaggttttacttgctttaaaaaacctcccacacctccccctgaa 

cctgaaacataaaatgaatgcaattgttgttgttaacttgtttattgcagcttataatgg 

ttacaaa^aaagcaatagcatcacaaatttcacaaataaagcatttttttcactgcattc 

?agttgtggtttgtccaaactcatcaatgtatcttatcatgtctggatcgatcctgcatt 

aatgaatcggccaacgcgcggggagaggcggtttgcgtattggctggcgtaatagcgaag 

aggcccgcaccgatcgcccttcccaacagttgcgcagcctgaatggcgaatgggacgcgc 

otgtagcSgcgcattaagcgcggcgggtgtggtggttacgcgcagcgtgaccgctacac 

?5gccagcg?cctagcgcccgctcctttcgctttcttcccttcctttctcgc 

ccggctttccccgtcaagctctaaatcgggggctccctttagggttccgatttagtgctt- 
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TACGGCACCTCGACCCCAAAAAACTTGATTAGGGTGATGGTTCACGTAGTGGGCCATCGC 

CCTGATAGACGGTTTTTCGCCCTTTGACGTTGGAGTCCACGTTCTTTAATAGTGGACTCT 

TGTTCCAAACTGGAACAACACTCAACCCTATCTCGGTCTATTCTTTTGATTTATAAGGGA 

TTTTGCCGATTTCGGCCTATTGGTTAAAAAATGAGCTGATTTAACAAATATTTAACGCGA 

ATTTTAACAAAATATTAACGTTTACAATTTCGCCTGATGCGGTATTTTCTCCTTACGCAT 

CTGTGCGGTATTTCACACCGCATACGCGGATCTGCGCAGCACCATGGCCTGAAATAACCT 

CTGAAAGAGGAACTTGGTTAGGTACCTTCTGAGGCGGAAAGAACCAGCTGTGGAATGTGT 

GTCAGTTAGGGTGTGGAAAGTCCCCAGGCTCCCCAGCAGGCAGAAGTATGCAAAGCATGC 

ATCTCAATTAGTCAGCAACCAGGTGTGGAAAGTCCCCAGGCTCCCCAGCAGGCAGAAGTA 

T G C AAAG CAT G CAT CT C AAT T AGT C AG C AAG C AT AGT CCCGCCCC T AACT C CG C C CAT C C 

CGCCCCTAACTCCGCCCAGTTCCGCCCATTCTCCGCCCCATGGCTGACTAATTTTTTTTA 

TTTATGCAGAGGCCGAGGCCGCCTCGGCCTCTGAGCTATTCCAGAAGTAGTGAGGAGGCT 

TTTTTGGAGGCCTAGGCTTTTGCAAAAAGCTTGATTCTTCTGACACAACAGTCTCGAACT 

TAAGGCTAGAGCCACCATGATTGAACAAGATGGATTGCACGCAGGTTCTCCGGCCGCTTG 

GGTGGAGAGGCTATTCGGCTATGACTGGGCACAACAGACAATCGGCTGCTCTGATGCCGC 

CGTGTTCCGGCTGTCAGCGCAGGGGCGCCCGGTTCTTTTTGTCAAGACCGACCTGTCCGG 

TGCCCTGAATGAACTGCAGGACGAGGCAGCGCGGCTATCGTGGCTGGCCACGACGGGCGT 

TCCTTGCGCAGCTGTGCTCGACGTTGTCACTGAAGCGGGAAGGGACTGGCTGCTATTGGG 

CGAAGTGCCGGGGCAGGATCTCCTGTCATCTCACCTTGCTCCTGCCGAGAAAGTATCCAT 

CATGGCTGATGCAATGCGGCGGCTGCATACGCTTGATCCGGCTACCTGCCCATTCGACCA 

CCAAGCGAAACATCGCATCGAGCGAGCACGTACTCGGATGGAAGCCGGTCTTGTCGATCA 

GGATGATCTGGACGAAGAGCATCAGGGGCTCGCGCCAGCCGAACTGTTCGCCAGGCTCAA 

GGCGCGCATGCCCGACGGCGAGGATCTCGTCGTGACCCATGGCGATGCCTGCTTGCCGAA 

TATCATGGTGGAAAATGGCCGCTTTTCTGGATTCATCGACTGTGGCCGGCTGGGTGTGGC 

GGACCGCTATCAGGACATAGCGTTGGCTACCCGTGATATTGCTGAAGAGCTTGGCGGCGA 

ATGGGCTGACCGCTTCCTCGTGCTTTACGGTATCGCCGCTCCCGATTCGCAGCGCATCGC 

CTTCTATCGCCTTCTTGACGAGTTCTTCTGAGCGGGACTCTGGGGTTCGAAATGACCGAC 

CAAGCGACGCCCAACCTGCCATCACGATGGCCGCAATAAAATATCTTTATTTTCATTACA 

TCTGTGTGTTGGTTTTTTGTGTGAATCGATAGCGATAAGGATCCGCGTATGGTGCACTCT 

CAGTACAATCTGCTCTGATGCCGCATAGTTAAGCCAGCCCCGACACCCGCCAACACCCGC 

TGACGCGCCCTGACGGGCTTGTCTGCTCCCGGCATCCGCTTACAGACAAGCTGTGACCGT 

CTCCGGGAGCTGCATGTGTCAGAGGTTTTCACCGTCATCACCGAAACGCGCGAGACGAAA 

GGGCCTCGTGATACGCCTATTTTTATAGGTTAATGTCATGATAATAATGGTTTCTTAGAC 

GTCAGGTGGCACTTTTCGGGGAAATGTGCGCGGAACCCCTATTTGTTTATTTTTCTAAAT 

ACATTCAAATATGTATCCGCTCATGAGACAATAACCCTGATAAATGCTTCAATAATATTG 

AAAAAGGAAGAGTATGAGTATTCAACATTTCCGTGTCGCCCTTATTCCCTTTTTTGCGGC 

ATTTTGC CTT C CTGTTTTT G CT C AC C CAGAAACGCTGGTGAAAGT AAAAG ATG C TGAAGA 

TCAGTTGGGTGCACGAGTGGGTTACATCGAACTGGATCTCAACAGCGGTAAGATCCTTGA 

GAGTTTTCGCCCCGAAGAACGTTTTCCAATGATGAGCACTTTTAAAGTTCTGCTATGTGG 

CGCGGTATTATCCCGTATTGACGCCGGGCAAGAGCAACTCGGTCGCCGCATACACTATTC 

TCAGAATGACTTGGTTGAGTACTCACCAGTCACAGAAAAGCATCTTACGGATGGCATGAC 

AGTAAGAGAATTATGCAGTGCTGCCATAACCATGAGTGATAACACTGCGGCCAACTTACT 

TCTGACAACGATCGGAG'GACCGAAGGAGCTAACCGCTTTTTTGCACAACATGGGGGATCA 

TGTAACTCGCCTTGATCGTTGGGAACCGGAGCTGAATGAAGCCATACCAAACGACGAGCG 

TGACACCACGATGCCTGTAGCAATGGCAACAACGTTGCGCAAACTATTAACTGGCGAACT 

ACTTACTCTAGCTTCCCGGCAACAATTAATAGACTGGATGGAGGCGGATAAAGTTGCAGG 

ACCACTTCTGCGCTCGGCCCTTCCGGCTGGCTGGTTTATTGCTGATAAATCTGGAGCCGG 

TGAGCGTGGGTCTCGCGGTATCATTGCAGCACTGGGGCCAGATGGTAAGCCCTCCCGTAT 

CGTAGTTATCTACACGACGGGGAGTCAGGCAACTATGGATGAACGAAATAGACAGATCGC 

TGAGATAGGTGCCTCACTGATTAAGCATTGGTAACTGTCAGACCAAGTTTACTCATATAT 

ACTTTAGATTGATTTAAAACTTCATTTTTAATTTAAAAGGATCTAGGTGAAGATCCTTTT 

TGATAATC-TCATGACCAAAATCCCTTAACGTGAGTTTTCGTTCCACTGAGCGTCAGACCC 

CGTAGAAAAGATCAAAGGATCTTCTTGAGATCCTTTTTTTCTGCGCGTAATCTGCTGCTT 

G C AAAC AAAAAAAC C AC CG CT AC CAGCGGTGGTTTGTTTG C CGG AT C AAG AG C T AC C AAC 

TCTTTTTCCGAAGGTAACTGGCTTCAGCAGAGCGCAGATACCAAATACTGTCCTTCTAGT 

GTAGCCGTAGTTAGGCCACCACTTCAAGAACTCTGTAGCACCGCCTACATACCTCGCTCT 

GCTAATCCTGTTACCAGTGGCTGCTGCCAGTGGCGATAAGTCGTGTCTTACCGGGTTGGA 

CTCAAGACGATAGTTACCGGATAAGGCGCAGCGGTCGGGCTGAACGGGGGGTTCGTGCAC- 
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ACAGCCCAGCTTGGAGCGAACGACCTACACCGAACTGAGATACCTACAGCGTGAGCATTG 
AGAAAGCGCCACGCTTCCCGAAGGGAGAAAGGCGGACAGGTATCCGGTAAGCGGCAGGGT 
CGGAACAGGAGAGCGCACGAGGGAGCTTCCAGGGGGAAACGCCTGGTATCTTTATAGTCC 
TGTCGGGTTTCGCCACCTCTGACTTGAGCGTCGATTTTTGTGATGCTCGTCAGGGGGGCG 
GAGCCTATGGAAAAACGCCAGCAACGCGGCCTTTTTACGGTTCCTGGCCTTTTGCTGGCC 
TTTTGCTCACATGTTCTTTCCTGCGTTATCCCCTGATTCTGTGGATAACCGTATTACCGC 
CTTTGAGTGAGCTGATACCGCTCGCCGCAGCCGAACGACCGAGCGCAGCGAGTCAGTGAG 
CGAGGAAGCGGAAGAGCGCCCAATACGCAAACCGCCTCTCCCCGCGCGTTGGCCGATTCA 
TTAATGCAGAGCTTGCAATTCGCGCGTTTTTCAATATTATTGAAGCATTTATCAGGGTTA 
TTGTCTCATGAGCGGATACATATTTGAATGTATTTAGAAAAATAAACAAATAGGGGTTCC 
GCGCACATTTCCCCGAAAAGTGCCACCTGACGTCTAAGAAACCATTATTATCATGACATT 
AACCTATAAAAATAGGCGTAGTACGAGGCCCTTTCACTCATTAG 
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CMV promoter 




f1 intergenic region 
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pDEST31 7559 bp 

CA^ATGCCAAGTACGCCCCCTATTGACGTCAATGACGGTAAATGGCCCGCCTGGCATTAT 

gcSIotacatgaccttatgggactttcctacttggcagtacatctacgtatta 
gSStaccatggtgatgcggttttggcagtacatcaatgggcgtggatagcggtttgac 

TrArrrrGATTTCCAAGTCTCCACCCCATTGACGTCAATGGGAGTTTGTTTTGGCACCAA 
AATCAACG^ 

AGGC GTGT ACGGTGGGAGGT CT ATAT AAG CAG AGCT CT C C CT AT C AGTG ATAGAGAT CT C 

cctatSgtoma^atcgtcgacgagctcgtttagtgaaccgtcagatcgcctggaqa 

C§?CATCCA?GC?GTTTTGACCTCCA^ 

ATGGCGTACTACCATCACCATCACCATCACACCGGTGATATCCTCGAGCCCATCACAAGT 

TTGTACA^AAAAGCTGAACGAGAAACGTAAAATGATATAAATATCAATATA 

ATTTTGCATAAAAAACAGACTACATAATACTGTAAAACACAACATATCCAGTCACTATGG 

CGGCCGCATTAGGCACCCCAGGCTTTACACTTTATGCTTCCGGCTCGTATAATGTGTGGA 

TTTTGAGTTAGGATCCGGCGAGATTTTCAGGAGCTAAGGAAGCTAAAATGGAGAAAAA^ 

TCACTGGATATACCACCGTTGATATATCCCAATGGCATCGTAAAGAACATTTTGAGGCAT 

TTCAGTCAGTTGCTCAATGTACCTATAACCAGACCGTTCAGCTGGATATTACGGCCTTTT 

?aSgaccgtaaagaaaaataagcacaagtttt^^ 

GCCTGATGAATGCTCATCCGGAATTCCGTATGGCAATGAAAGACGGTGAGCTGGTGATAT 
GGGATAGTGTTCACCCTTGTTACACCGTTTTCCATGAGCAAACTGAAACGTTTTCATCGC 
TCTGG^GTGAATACCACGACGATTTCCGGCAGTTTCTACACATATATTCGCAAGATGTGG 
J^^^'™q?^ CCTGGCCTATTT CCCTAAAGGGTTTATTGAGAATATGTTTTTCG 

TCTCAGCCAATCC^T^GGTGAGTTTCACCAGTTTTGATTTA^ 

ACTTCTTCGCCCCCGTTTTCACCATGGGCAAATATTATACGCAAGGCGACAAGGTGCTGA 
TGCCGCTGGCGATTCAGGTTCATCATGCCGTCTGTGATGGCTTCCATGTCGGCAGAATGC 

Saat^aatScaacS^ 

GCTTACTAAAAGCCAGATAACAGTATGCGTATTTGCGCGCTGATTTTTGCGGTATAAGAA 
TA^A^ACTGATATGTATACCCGAAGTATGTCAAAAAGAGGTGTGCTATGAAGCAGCGTAT 

?aSgtgIcagttgacagcgacagctatca^ 

TCCGGTCTGGTAAGCACAACCATGCAGAATGAAGCCCGTCGTCTGCGTGCCGAACGCTGG 

aaXgcggaaaatcaggaagggatggctgaggtcgcccggtttattgaaatgaacggctct 

TTTGCTGACGAGAACAGGGACTGGTGAAATGCAGTTTAAGGTTTACACCTATAAAAGAGA 

gag?cg??atcgtctgtttgtg 
Sa?gg?Stcc2cctggccagtgcacgtctgct 

?ccgg?gotScSatcgS 

gccggtctccgttatcggggaagaagtggctgatctcagccaccgcgaaaatgacatc^ 
aaacgccattaacctgatgttctggggaatataaatgtcaggctccgttatacacagcca 
g1^?gcaggtcgaccatagtgactggatatgttgtgttttacagtattatgtagtctgt 

ctttcttgtacaaagtggtgatgggcggccgctctagagggcccaagcttacgcgtgcat 

SgaS?Stag^?tc?ccctatagtgagtcgtatta^ 

?ttacaacgtcgtgactgggaaaactgctagcttgggatctttgtg^ 

ctgtgotgtgacataattggacaaactacctacagagatttaaagctctaaggtaaa 

aIaatttttaagtgtataatgtgttaaactagctgcatatgcttgctgcttgagagtttt 

g^toctgagtatgatttatgaaaatattatacacaggagctagtgattct^ 

?ctat5??^ga?tcacagtcccaaggctcatttcaggc 
Stca?cataatcagccataccacatttgtagaggttttacttgctttaa^ 
a^ac^tcccc^tgaacctgaaacataaaatgaatgcaattgttgttgttaacttgtttat 
tgcagcttataatggttacaaataaagcaatagcatcacaaatttcacaaataaagcatt 

??5ttcactgcattctagttgtggtttgtccaaactcatcaatgt^^ 

gatcgatcctgcattaatgaatcggccaacgcgcggggagaggcggtttgcgtattggct 

ggcgtaatagcgaagaggcccgcaccgatcgcccttcccaacagttgcgcagcctgaatg 

g^aItcggacg^ccctgtagcggcgcattaagcgcggcgggtgtggtggttacgcgc 

gStcaccgctacacttgccagcgccctagcgcccgctcctttcgctttcttccc 

?tctcgccacgttcgccggctttccccgtcaagctctaaatcgggggc 
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TCCGATTTAGTGCTTTACGGCACCTCGACCCCAAAAAACTTGATTAGGGTGATGGTTCAC 

GTAGTGGGCCATCGCCCTGATAGACGGTTTTTCGCCCTTTGACGTTGGAGTCCACGTTCT 

TTAATAGTGGACTCTTGTTCCAAACTGGAACAACACTCAACCCTATCTCGGTCTATTCTT 

TTGATTTATAAGGGATTTTGCCGATTTCGGCCTATTGGTTAAAAAATGAGCTGATTTAAC 

AAATATTTAACGGGAATTTTAACAAAATATTAACGTTTACAATTTCGCCTGATGCGGTAT 

TTTCTCCTTACGCATCTGTGCGGTATTTCACACCGCATACGCGGATCTGCGCAGCACCAT 

GGCCTGAAATAACCTCTGAAAGAGGAACTTGGTTAGGTACCTTCTGAGGCGGAAAGAACC 

AGCTGTGGAATGTGTGTCAGTTAGGGTGTGGAAAGTCCCCAGGCTCCCCAGCAGGCAGAA 

GT ATG C AAAG C ATGC AT CT CAATTAGT CAGC AAC C AGGTGTGGAAAGT C CC C AGG CT C C C 

CAGCAGGCAGAAGTATGCAAAGCATGCATCTCAATTAGTCAGCAACCATAGTCCCGCCCC 

TAACTCCGCCCATCCCGCCCCTAACTCCGCCCAGTTCCGCCCATTCTCCGCCCCATGGCT 

GACTAATTTTTTTTATTTATGCAGAGGCCGAGGCCGCCTCGGCCTCTGAGCTATTCCAGA 

AGTAGTGAGGAGGCTTTTTTGGAGGCCTAGGCTTTTGCAAAAAGCTTGATTCTTCTGACA 

CAACAGTCTCGAACTTAAGGCTAGAGCCACCATGATTGAACAAGATGGATTGCACGCAGG 

TTCTCCGGCCGCTTGGGTGGAGAGGCTATTCGGCTATGACTGGGCACAACAGACAATCGG 

CTGCTCTGATGCCGCCGTGTTCCGGCTGTCAGCGCAGGGGCGCCCGGTTCTTTTTGTCAA 

GACCGACCTGTCCGGTGCCCTGAATGAACTGCAGGACGAGGCAGCGCGGCTATCGTGGCT 

GGCCACGACGGGCGTTCCTTGCGCAGCTGTGCTCGACGTTGTCACTGAAGCGGGAAGGGA 

CTGGCTGCTATTGGGCGAAGTGCCGGGGCAGGATCTCCTGTCATCTCACCTTGCTCCTGC 

CGAGAAAGTATCCATCATGGCTGATGCAATGCGGCGGCTGCATACGCTTGATCCGGCTAC 

CTGCCCATTCGACCACCAAGCGAAACATCGCATCGAGCGAGCACGTACTCGGATGGAAGC 

CGGTCTTGTCGATCAGGATGATCTGGACGAAGAGCATCAGGGGCTCGCGCCAGCCGAACT 

GTTCGCCAGGCTCAAGGCGCGCATGCCCGACGGCGAGGATCTCGTCGTGACCCATGGCGA 

TGCCTGCTTGCCGAATATCATGGTGGAAAATGGCCGCTTTTCTGGATTCATCGACTGTGG 

CCGGCTGGGTGTGGCGGACCGCTATCAGGACATAGCGTTGGCTACCCGTGATATTGCTGA 

AGAGCTTGGCGGCGAATGGGCTGACCGCTTCCTCGTGCTTTACGGTATCGCCGCTCCCGA 

TTCGCAGCGCATCGCCTTCTATCGCCTTCTTGACGAGTTCTTCTGAGCGGGACTCTGGGG 

TTCGAAATGACCGACCAAGCGACGCCCAACCTGCCATCACGATGGCCGCAATAAAATATC 

TTTATTTTCATTACATCTGTGTGTTGGTTTTTTGTGTGAATCGATAGCGATAAGGATCCG 

CGTATGGTGCACTCTCAGTACAATCTGCTCTGATGCCGCATAGTTAAGCCAGCCCCGACA 

CCCGCCAACACCCGCTGACGCGCCCTGACGGGCTTGTCTGCTCCCGGCATCCGCTTACAG 

ACAAGCTGTGACCGTCTCCGGGAGCTGCATGTGTCAGAGGTTTTCACCGTCATCACCGAA 

ACGCGCGAGACGAAAGGGCCTCGTGATACGCCTATTTTTATAGGTTAATGTCATGATAAT 

AATGGTTTCTTAGACGTCAGGTGGCACTTTTCGGGGAAATGTGCGCGGAACCCCTATTTG 

TTTATTTTTCTAAATACATTCAAATATGTATCCGCTCATGAGACAATAACCCTGATAAAT 

GCTTCAATAATATTGAAAAAGGAAGAGTATGAGTATTCAACATTTCCGTGTCGCCCTTAT 

TCCCTTTTTTGCGGCATTTTGCCTTCCTGTTTTTGCTCACCCAGAAACGCTGGTGAAAGT 

AAAAGATGCTGAAGATCAGTTGGGTGCACGAGTGGGTTACATCGAACTGGATCTCAACAG 

C GGT AAG AT C CTTGAGAGTTTT CGCC C CGAAGAACGTT TT C CAATGATGAGC ACTTTTAA 

AGTTCTGCTATGTGGCGCGGTATTATCCCGTATTGACGCCGGGCAAGAGCAACTCGGTCG 

CCGCATACACTATTCTCAGAATGACTTGGTTGAGTACTCACCAGTCACAGAAAAGCATCT 

TACGGATGGCATGACAGTAAGAGAATTATGCAGTGCTGCCATAACCATGAGTGATAACAC 

TGCGGCCAACTTACTTCTGACAACGATCGGAGGACCGAAGGAGCTAACCGCTTTTTTGCA 

CAACATGGGGGATCATGTAACTCGCCTTGATCGTTGGGAACCGGAGCTGAATGAAGCCAT 

ACCAAACGACGAGCGTGACACCACGATGCCTGTAGCAATGGCAACAACGTTGCGCAAACT 

ATTAACTGGCGAACTACTTACTCTAGCTTCCCGGCAACAATTAATAGACTGGATGGAGGC 

GGATAAAGTTGCAGGACCACTTCTGCGCTCGGCCCTTCCGGCTGGCTGGTTTATTGCTGA 

TAAATCTGGAGCCGGTGAGCGTGGGTCTCGCGGTATCATTGCAGCACTGGGGCCAGATGG 

TAAGCCCTCCCGTATCGTAGTTATCTACACGACGGGGAGTCAGGCAACTATGGATGAACG 

AAATAGACAGATCGCTGAGATAGGTGCCTCACTGATTAAGCATTGGTAACTGTCAGACCA 

AGTTTACTCATATATACTTTAGATTGATTTAAAACTTCATTTTTAATTTAAAAGGATCTA 

GGTGAAGATCCTTTTTGATAATCTCATGACCAAAATCCCTTAACGTGAGTTTTCGTTCCA 

CTGAGCGTCAGACCCCGTAGAAAAGATCAAAGGATCTTCTTGAGATCCTTTTTTTCTGCG 

CGTAATCTGCTGCTTGCAAACAAAAAAACCACCGCTACGAGCGGTGGTTTGTTTGCCGGA 

TCAAGAGCTACCAACTCTTTTTCCGAAGGTAACTGGCTTCAGCAGAGCGCAGATACCAAA 

TACTGTCCTTCTAGTGTAGCCGTAGTTAGGCCACCACTTCAAGAACTCTGTAGCACCGCC 

TACATACCTCGCTCTGCTAATCCTGTTACCAGTGGCTGCTGCCAGTGGCGATAAGTCGTG 

TCTTACCGGGTTGGACTCAAGACGATAGTTACCGGATAAGGCGCAGCGGTCGGGCTGAAC- 
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GGGGGGTTCGTGCACACAGCCCAGCTTGGAGCGAACGACCTACACCGAACTGAGATACCT 
ACAGCGTGAGCATTGAGAAAGCGCCACGCTTCCCGAAGGGAGAAAGGCGGACAGGTATCC 
GGTAAGCGGCAGGGTCGGAACAGGAGAGCGCACGAGGGAGCTTCCAGGGGGAAACGCCTG 
GTATCTTTATAGTCCTGTCGGGTTTCGCCACCTCTGACTTGAGCGTCGATTTTTGTGATG 
CTCGTCAGGGGGGCGGAGCCTATGGAAAAACGCCAGCAACGCGGCCTTTTTACGGTTCCT 
GGCCTTTTGCTGGCCTTTTGCTCACATGTTCTTTCCTGCGTTATCCCCTGATTCTGTGGA 
TAACCGTATTACCGCCTTTGAGTGAGCTGATACCGCTCGCCGCAGCCGAACGACCGAGCG 
CAGCGAGTCAGTGAGCGAGGAAGCGGAAGAGCGCCCAATACGCAAACCGCCTCTCCCCGC 
GCGTTGGCCGATTCATTAATGCAGAGCTTGCAATTCGCGCGTTTTTCAATATTATTGAAG 
CATTTATCAGGGTTATTGTCTCATGAGCGGATACATATTTGAATGTATTTAGAAAAATAA 
ACAAATAGGGGTTCCGCGCACATTTCCCCGAAAAGTGCCACCTGACGTCTAAGAAACCAT 
TATTATCATGACATTAACCTATAAAAATAGGCGTAGTACGAGGCCCTTTCACTCATTAG 
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ARS4/CEN6 
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pDEST3 2 122 8 8 bp 

GACGAAAGGGCCTCGTGATACGCCTATTTTTATAGGTTAATGTCATGATAATAATGGTTT 

CTTAGGACGGATCGCTTGCCTGTAACTTACACGCGCCTCGTATCTTTTAATGATGGAATA 

ATTTGGGAATTTACTCTGTGTTTATTTATTTTTATGTTTTGTATTTGGATTTTAGAAAGT 

AAATAAAGAAGGTAGAAGAGTTACGGAATGAAGAAAAAAAAATAAACAAAGGTTTAAAAA 

ATTTCAACAAAAAGCGTACTTTACATATATATTTATTAGACAAGAAAAGCAGATTAAATA 

GATATACATTCGATTAACGATAAGTAAAATGTAAAATCACAGGATTTTCGTGTGTGGTCT 

TCTACACAGACAAGATGAAACAATTCGGCATTAATACCTGAGAGCAGGAAGAGCAAGATA 

AAAGGTAGTATTTGTTGGCGATCCCCCTAGAGTCTTTTACATCTTCGGAAAACAAAAACT 

ATTTTTTCTTTAATTTCTTTTTTTACTTTCTATTTTTAATTTATATATTTATATTAAAAA 

ATTTAAATTATAATTATTTTTATAGCACGTGATGAAAAGGACCCAGGTGGCACTTTTCGG 

GGAAATGTGCGCGGAACCCCTATTTGTTTATTTTTCTAAATACATTCAAATATGTATCCG 

CT C ATG AG AC AAT AAC C CTG AT AAATG CTT C AAT AAT CTGC AGTG CG C AGGG C C CGTGT C 

T C AAAAT CT CTGATGTT AC ATT GC AC AAGAT AAAAATATAT C ATC ATGAACAAT AAAACT 

GTCTGCTTACATAAACAGTAATACAAGGGGTGTTATGAGCCATATTCAACGGGAAACGTC 

TTGCTGGAGGCCGCGATTAAATTCCAACATGGATGCTGATTTATATGGGTATAAATGGGC 

TCGGTAGCCAACCACTAGAACTATAGCTAGAGTCCTGGGCGAACAAACGATGCTCGCCTT 

CCAGAAAACCGAGGATGCGAACCACTTCATCCGGGGTCAGCACCACCGGCAAGCGCCGCG 

ACGGCCGAGGTCTTCCGATCTCCTGAAGCCAGGGCAGATCCGTGCACAGCACCTTGCCGT 

AGAAGAACAGCAAGGCCGCCAATGCCTGACGATGCGTGGAGACCGAAACCTTGCGCTCGT 

TCGCCAGCCAGGACAGAAATGCCTCGACTTCGCTGCTGCCCAAGGTTGCCGGGTGACGCA 

CACCGTGGAAACGGATGAAGGCACGAACCCAGTTGACATAAGCCTGTTCGGTTCGTAAAC 

TGTAATGCAAGTAGCGTATGCGCTCACGCAACTGGTCCAGAACCTTGACCGAACGCAGCG 

GTGGTAACGGCGCAGTGGCGGTTTTCATGGCTTGTTATGACTGTTTTTTTGTACAGTCTA 

TGCCTCGGGCATCCAAGCAGCAAGCGCGTTACGCCGTGGGTCGATGTTTGATGTTATGGA 

GCAGCAACGATGTTACGCAGCAGCAACGATGTTACGCAGCAGGGCAGTCGCCCTAAAACA 

AAGTTAGGTGGCTCAAGTATGGGCATCATTCGCACATGTAGGCTCGGCCCTGACCAAGTC 

AAATCCATGCGGGCTGCTCTTGATCTTTTCGGTCGTGAGTTCGGAGACGTAGCCACCTAC 

TCCCAACATCAGCCGGACTCCGATTACCTCGGGAACTTGCTCCGTAGTAAGACATTCATC 

GCGCTTGCTGCCTTCGACCAAGAAGCGGTTGTTGGCGCTCTCGCGGCTTACGTTCTGCCC 

AGGTTTGAGCAGCCGCGTAGTGAGATCTATATCTATGATCTCGCAGTCTCCGGCGAGCAC 

CGGAGGCAGGGCATTGCCACCGCGCTCATCAATCTCCTCAAGCATGAGGCCAACGCGCTT 

GGTGCTTATGTGATCTACGTGCAAGCAGATTACGGTGACGATCCCGCAGTGGCTCTCTAT 

ACAAAGTTGGGCATACGGGAAGAAGTGATGCACTTTGATATCGACCCAAGTACCGCCACC 

TAACAATTCGTTCAAGCCGAGATCGGCTTCCCGGCCTAATAGGTTGTATTGATGTTGGAC 

GAGTCGGAATCGCAGACCGATACCAGGATCTTGCCATCCTATGGAACTGCCTCGGTGAGT 

TTTCTCCTTCATTACAGAAACGGCTTTTTCAAAAATATGGTATTGATAATCCTGATATGA 

ATAAATTGCAGTTTCATTTGATGCTCGATGAGTTTTTCTAATCAGAATTGGTTAATTGGT 

TGTAACACTGGCAGAGCATTACGCTGACTTGACGGGACGGCGNCATGACCAAAATCCCTT 

AACGTGAGTTTTCGTTCCACTGAGCGTCAGACCCCGTAGAAAAGATCAAAGGATCTTCTT 

GAGATCCTTTTTTTCTGCGCGTAATCTGCTGCTTGCAAACAAAAAAACCACCGCTACCAG 

CGGTGGTTTGTTTGCCGGATCAAGAGCTACCAACTCTTTTTCCGAAGGTAACTGGCTTCA 

GCAGAGCGCAGATACCAAATACTGTCCTTCTAGTGTAGCCGTAGTTAGGCCACCACTTCA 

AGAACTCTGTAGCACCGCCTACATACCTCGCTCTGCTAATCCTGTTACCAGTGGCTGCTG 

CCAGTGGCGATAAGTCGTGTCTTACCGGGTTGGACTCAAGACGATAGTTACCGGATAAGG 

CGCAGCGGTCGGGCTGAACGGGGGGTTCGTGCACACAGCCCAGCTTGGAGCGAACGACCT 

ACACCGAACTGAGATACCTACAGCGTGAGCATTGAGAAAGCGCCACGCTTCCCGAAGGGA 

GAAAGGCGGACAGGTATCCGGTAAGCGGCAGGGTCGGAACAGGAGAGCGCACGAGGGAGC 

TTCCAGGGGGGAACGCCTGGTATCTTTATAGTCCTGTCGGGTTTCGCCACCTCTGACTTG 

AGCGTCGATTTTTGTGATGCTCGTCAGGGGGGCCGAGCCTATGGAAAAACGCCAGCAACG 

CGGCCTTTTTACGGTTCCTGGCCTTTTGCTGGCCTTTTGCTCACATGTTCTTTCCTGCGT 

TATCCCCTGATTCTGTGGATAACCGTATTACCGCCTTTGAGTGAGCTGATACCGCTCGCC 

GCAGCCGAACGACCGAGCGCAGCGAGTCAGTGAGCGAGGAAGCGGAAGAGCGCCCAATAC 

GCAAACCGCCTCTCCCCGCGCGTTGGCCGATTCATTAATGCAGCTGGCACGACAGGTTTC 

CCGACTGGAAAGCGGGCAGTGAGCGCAACGCAATTAATGTGAGTTACCTCACTCATTAGG 

CACCCCAGGCTTTACACTTTATGCTTCCGGCTCCTATGTTGTGTGGAATTGTGAGCGGAT 

AACAATTTCACACAGGAAACAGCTATGACCATGATTACGCCAAGCTCGGAATTAACCCTC - 
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ACTAAAGGGAACAAAAGCTGGTACCGATCCCGAGCTTTGCAAATTAAAGCCTTCGAGCGT 
CCCAAAACCTTCTCAAGCAAGGTTTTCAGTATAATGTTACATGCGTACACGCGTCTGTAC 
AGAAAAAAAAGAAAAATTTGAAATATAAATAACGTTCTTAATACTAACATAACTATAAAA 

AAAT AAAT AG GG AC CTAGACT TC AGGTTGT CT AACT C CTT C CTTTT CGGTTAGAGCGG AT 

GTGGGGGGAGGGCGTGAATGTAAGCGTGACATAACTAATTACATGATATCGACAAAGGAA 

AAGGGGCCTGTTTACTCACAGGCTTTTTTCAAGTAGGTAATTAAGTCGTTTCTGTCTTTT 

TCCTTCTTCAACCCACCAAAGGCCATCTTGGTACTTTTTTTTTTTTTTTTTTTTTTTTTT 

TTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT 

TTTTTTTTCATAGAAATAATACAGAAGTAGATGTTGAATTAGATTAAACTGAAGATATAT 

AATTTATTGGAAAATACATAGAGCTTTTTGTTGATGCGCTTAAGCGATCAATTCAACAAC 

ACCACCAGCAGCTCTGATTTTTTCTTCAGCCAACTTGGAGACGAATCTAGCTTTGACGAT 

AACTGGAACATTTGGAATTCTACCCTTACCCAAGATCTTACCGTAACCGGCTGCCAAAGT 

GTCAATAACTGGAGCAGTTTCCTTAGAAGCAGATTTCAAGTATTGGTCTCTCTTGTCTTC 

TGGGATCAATGTCCACAATTTGTCCAAGTTCAAGACTGGCTTCCAGAAATGAGCTTGTTG 

CTTGTGGAAGTATCTCATACCAACCTTACCGAAATAACCTGGATGGTATTTATCCATGTT 

AATTCTGTGGTGATGTTGACCACCGGCCATACCTCTACCACCGGGGTGCTTTCTGTGCTT 

ACCGATACGACCTTTACCGGCTGAGACGTGACCTCTGTGCTTTCTAGTCTTAGTGAATCT 

GGAAGGCATTCTTGATTAGTTGGATGATTGTTCTGGGATTTAATGCAAAAATCACTTAAG 

AAGG AAAAT C AACGGAG AAAG C AAACGC C AT C TT AAAT AT AC GGGAT AC AG AT G AAAGGG 

TTTGAACCTATCTGGAAAATAGCATTAAACAAGCGAAAAACTGCGAGGAAAATTGTTTGC 

GTCTCTGCGGGCTATTCACGCGCCAGAGGAAAATAGGAAAAATAACAGGGCATTAGAAAA 

ATAATTTTGATTTTGGTAATGTGTGGGTCCTGGTGTACAGATGTTACATTGGTTACAGTA 

CTCTTGTTTTTGCTGTGTTTTTCGATGAATCTCCAAAATGGTTGTTAGCACATGGAAGAG 

TCACCGATGCTAAGTTATCTCTATGTAAGCTACGTGGCGTGACTTTTGATGAAGCCGCAC 

AAGAGATACAGGATTGGCAACTGCAAATAGAATCTGGGGATCCCCCCTCGAGATCCGGGA 

T CGAAGAAATGATGGT AAATGAAATAGGAAAT C AAGGAGC ATGAAGGC AAAAGAC AAATA 

TAAGGGTCGAACGAAAAATAAAGTGAAAAGTGTTGATATGATGTATTTGGCTTTGCGGCG 

CCGAAAAAACGAGTTTACGCAATTGCACAATCATGCTGACTCTGTGGCGGACCCGCGCTC 

TTGCCGGCCCGGCGATAACGCTGGGCGTGAGGCTGTGCCCGGCGGAGTTTTTTGCGCCTG 

CATTTTCCAAGGTTTACCCTGCGCTAAGGGGCGAGATTGGAGAAGCAATAAGAATGCCGG 

TTGGGGTTGCGATGATGACGACCACGACAACTGGTGTCATTATTTAAGTTGCCGAAAGAA 

CCTGAGTGCATTTGCAACATGAGTATACTAGAAGAATGAGCCAAGACTTGCGAGACGCGA 

GTTTGCCGGTGGTGCGAACAATAGAGCGACCATGACCTTGAAGGTGAGACGCGCATAACC 

GCTAGAGTACTTTGAAGAGGAAACAGCAATAGGGTTGCTACCAGTATAAATAGACAGGTA 

CATACAACACTGGAAATGGTTGTCTGTTTGAGTACGCTTTCAATTCATTTGGGTGTGCAC 

TTTATTATGTTACAATATGGAAGGGAACTTTACACTTCTCCTATGCACATATATTAATTA 

AAGTCCAATGCTAGTAGAGAAGGGGGGTAACACCCCTCCGCGCTCTTTTCCGATTTTTTT 

CTAAACCGTGGAATATTTCGGATATCCTTTTGTTGTTTCCGGGTGTACAATATGGACTTC 

CTCTTTTCTGGCAACCAAACCCATACATCGGGATTCCTATAATACCTTCGTTGGTCTCCC 

TAACATGTAGGTGGCGGAGGGGAGATATACAATAGAACAGATACCAGACAAGACATAATG 

GGCTAAACAAGACTACACCAATTACACTGCCTCATTGATGGTGGTACATAACGAACTAAT 

ACTGTAGCCCTAGACTTGATAGCCATCATCATATCGAAGTTTCACTACCCTTTTTCCATT 

TGCCATCTATTGAAGTAATAATAGGCGCATGCAACTTCTTTTCTTTTTTTTTCTTTTCTC 

TCTCCCCCGTTGTTGTCTCACCATATCCGCAATGACAAAAAAAATGATGGAAGACACTAA 

AGGAAAAAATTAACGACAAAGACAGCACCAACAGATGTCGTTGTTCCAGAGCTGATGAGG 

GGTATCTTCGAACACACGAAACTTTTTCCTTCCTTCATTCACGCACACTACTCTCTAATG 

AG C AACGGTAT ACGGC CTTC CTTC CAGTTACTTGAATTTGAAAT AAAAAAAGTTTGC CGC 

TTTGCTATCAAGTATAAATAGACCTGCAATTATTAATCTTTTGTTTCCTCGTCATTGTTC 

TCGTTCCCTTTCTTCCTTGTTTCTTTTTCTGCACAATATTTCAAGCTATACCAAGCATAC 

AATCAACTCCAAGCTTGAAGCAAGCCTCCTGAAAGATGAAGCTACTGTCTTCTATCGAAC 

AAGCATGCGATATTTGCCGACTTAAAAAGCTCAAGTGCTCCAAAGAAAAACCGAAGTGCG 

CCAAGTGTCTGAAGAACAACTGGGAGTGTCGCTACTCTCCCAAAACCAAAAGGTCTCCGC 

TGACTAGGGCACATCTGACAGAAGTGGAATCAAGGCTAGAAAGACTGGAACAGCTATTTC 

TACTGATTTTTCCTCGAGAAGACCTTGACATGATTTTGAAAATGGATTCTTTACAGGATA 

TAAAAGCATTGTTAACAGGATTATTTGTACAAGATAATGTGAATAAAGATGCCGTCACAG 

ATAGATTGGCTTCAGTGGAGACTGATATGCCTCTAACATTGAGACAGCATAGAATAAGTG 

C G AC AT CAT CAT CGG AAGAG AGT AGTAAC AAAGGT C AAAG AC AGTTGACTGT AT CGT CG A 

GGTCGAATCAAACAAGTTTGTACAAAAAAGCTGAACGAGAAACGTAAAATGATATAAATA- 
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TCAATATATTAAATTAGATTTTGCATAAAAAACAGACTACATAATACTGTAAAACACAAC 

ATATCCAGTCACTATGGCGGCCGCTAAGTTGGCAGCATCACCCGACGCACTTTGCGCCGA 

ATAAATACCTGTGACGGAAGATCACTTCGCAGAATAAATAAATCCTGGTGTCCCTGTTGA 

TACCGGGAAGCCCTGGGCCAACTTTTGGCGAAAATGAGACGTTGATCGGCACGTAAGAGG 

TTCCAACTTTCACCATAATGAAATAAGATCACTACCGGGCGTATTTTTTGAGTTATCGAG 

ATTTTCAGGAGCTAAGGAAGCTAAAATGGAGAAAAAAATCACTGGATATACCACCGTTGA 

TATATCCCAATGGCATCGTAAAGAACATTTTGAGGCATTTCAGTCAGTTGCTCAATGTAC 

CTATAACCAGACCGTTCAGCTGGATATTACGGCCTTTTTAAAGACCGTAAAGAAAAATAA 

GCACAAGTTTTATCCGGCCTTTATTCACATTCTTGCCCGCCTGATGAATGCTCATCCGGA 

ATTCCGTATGGCAATGAAAGACGGTGAGCTGGTGATATGGGATAGTGTTCACCCTTGTTA 

CACCGTTTTCCATGAGCAAACTGAAACGTTTTCATCGCTCTGGAGTGAATACCACGACGA 

TTTCCGGCAGTTTCTACACATATATTCGCAAGATGTGGCGTGTTACGGTGAAAACCTGGC 

CTATTTCCCTAAAGGGTTTATTGAGAATATGTTTTTCGTCTCAGCCAATCCCTGGGTGAG 

TTTCACCAGTTTTGATTTAAACGTGGCCAATATGGACAACTTCTTCGCCCCCGTTTTCAC 

CATGGGCAAATATTATACGCAAGGCGACAAGGTGCTGATGCCGCTGGCGATTCAGGTTCA 

TCATGCCGTCTGTGATGGCTTCCATGTCGGCAGAATGCTTAATGAATTACAACAGTACTG 

CGATGAGTGGCAGGGCGGGGCGTAATCTAGAGGATCCGGCTTACTAAAAGCCAGATAACA 

GTATGCGTATTTGCGCGCTGATTTTTGCGGTATAAGAATATATACTGATATGTATACCCG 

AAGTATGTCAAAAAGAGGTGTGCTATGAAGCAGCGTATTACAGTGACAGTTGACAGCGAC 

AGCTATCAGTTGCTCAAGGCATATATGATGTCAATATCTCCGGTCTGGTAAGCACAACCA 

TGCAGAATGAAGCCCGTCGTCTGCGTGCCGAACGCTGGAAAGCGGAAAATCAGGAAGGGA 

TGGCTGAGGTCGCCCGGTTTATTGAAATGAACGGCTCTTTTGCTGACGAGAACAGGGACT 

GGTGAAATGCAGTTTAAGGTTTACACCTATAAAAGAGAGAGCCGTTATCGTCTGTTTGTG 

GATGTACAGAGTGATATTATTGACACGCCCGGGCGACGGATGGTGATCCCCCTGGCCAGT 

GCACGTCTGCTGTCAGATAAAGTCTCCCGTGAACTTTACCCGGTGGTGCATATCGGGGAT 

GAAAGCTGGCGCATGATGACCACCGATATGGCCAGTGTGCGGGTCTCCGTTATCGGGGAA 

GAAGTGGCTGATCTCAGCCACCGCGAAAATGACATCAAAAACGCCATTAACCTGATGTTC 

TGGGGAATATAAATGTCAGGCTCCCTTATACACAGCCAGTCTGCAGGTCGACCATAGTGA 

CTGGATATGTTGTGTTTTACAGTATTATGTAGTCTGTTTTTTATGCAAAATCTAATTTAA 

TATATTGATATTTATATCATTTTACGTTTCTCGTTCAGCTTTCTTGTACAAAGTGGTTTG 

ATGGCCGCTAAGTAAGTAAGACGTCGAGCTCTAAGTAAGTAACGGCCGCCACCGCGGTGG 

AGCTTTGGACTTCTTCGCCAGAGGTTTGGTCAAGTCTCCAATCAAGGTTGTCGGCTTGTC 

TACCTTGCCAGAAATTTACGAAAAGATGGAAAAGGGTCAAATCGTTGGTAGATACGTTGT 

TGACACTTCTAAATAAGCGAATTTCTTATGATTTATGATTTTTATTATTAAATAAGTTAT 

AAAAAAAATAAGTGTATACAAATTTTAAAGTGACTCTTAGGTTTTAAAACGAAAATTCTT 

GTTCTTGAGTAACTCTTTCCTGTAGGTCAGGTTGCTTTCTCAGGTATAGCATGAGGTCGC 

TCTTATTGACCACACCTCTACCGGCATGCCGAGCAAATGCCTGCAAATCGCTCCCCATTT 

CACCCAATTGTAGATATGCTAACTCCAGCAATGAGTTGATGAATCTCGGTGTGTATTTTA 

TGTCCTCAGAGGACAATACCTGTTGTAATCGTTCTTCCACACGGATCCCAATTCGCCCTA 

TAGTGAGTCGTATTACAATTCACTGGCCGTCGTTTTACAACGTCGTGACTGGGAAAACCC 

TGGCGTTACCCAACTTAATCGCCTTGCAGCACATCCCCCTTTCGCCAGCTGGCGTAATAG 

CGAAGAGGCCCGCACCGATCGCCCTTCCCAACAGTTGCGCAGCCTGAATGGCGAATGGAC 

GCGCCCTGTAGCGGCGCATTAAGCGCGGCGGGTGTGGTGGTTACGCGCAGCGTGACCGCT 

ACACTTGCCAGCGCCCTAGCGCCCGCTCCTTTCGCTTTCTTCCCTTCCTTTCTCGCCACG 

TTCGCCGGCTTTCCCCGTCAAGCTCTAAATCGGGGGCTCCCTTTAGGGTTCCGATTTAG.T 

GCTTTACGGCACCTCGACCCCAAAAAACTTGATTAGGGTGATGGTTCACGTAGTGGGCCA 

TCGCCCTGATAGACGGTTTTTCGCCCTTTGACGTTGGAGTCCACGTTCTTTAATAGTGGA 

CTCTTGTTCCAAACTGGAACAACACTCAACCCTATCTCGGTCTATTCTTTTGATTTATAA 

GGGATTTTGCCGATTTCGGCCTATTGGTTAAAAAATGAGCTGATTTAACAAAAATTTAAC 

GCGAATTTTAACAAAATATTAACGTTTACAATTTCCTGATGCGGTATTTTCTCCTTACGC 

ATCTGTGCGGTATTTCACACCGCATATCGACCGGTCGAGGAGAACTTCTAGTATATCCAC 

ATACCTAATATTATTGCCTTATTAAAAATGGAATCGGAACAATTACATCAAAATCCACAT 

TCTCTTCAAAATCAATTGTCCTGTACTTCCTTGTTCATGTGTGTTCAAAAACGTTATATT 

TATAGGATAATTATACTCTATTTCTCAACAAGTAATTGGTTGTTTGGCCGAGCGGTCTAA 

GGCGCCTGATTCAAGAAATATCTTGACCGCAGTTAACTGTGGGAATACTCAGGTATCGTA 

AGATGCAAGAGTTCGAATCTCTTAGCAACCATTATTTTTTTCCTCAACATAACGAGAACA 

CACAGGGGCGCTATCGCACAGAATCAAATTCGATGACTGGAAATTTTTTGTTAATTTCAG 

AGGTCGCCTGACGCATATACCTTTTTCAACTGAAAAATTGGGAGAAAAAGGAAAGGTGAG- 
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AGGCCGGAACCGGCTTTTCATATAGAATAGAGAAGCGTTCATGACTAAATGCTTGCATCA 

CAATACTTGAAGTTGACAATATTATTT7VAGGACCTATTGTTTTTTCCAATAGGTGGTTAG 

CAATCGTCTTACTTTCTAACTTTTCTTACCTTTTACATTTCAGCAATATATATATATATT 

TCAAGGATATACCATTCTAATGTCTGCCCCTATGTCTGCCCCTAAGAAGATCGTCGTTTT 

GCCAGGTGACCACGTTGGTCAAGAAATCACAGCCGAAGCCATTAAGGTTCTTAAAGCTAT 

TTCTGATGTTCGTTCCAATGTCAAGTTCGATTTCGAAAATCATTTAATTGGTGGTGCTGC 

TATCGATGCTACAGGTGTCCCACTTCCAGATGAGGCGCTGGAAGCCTCCAAGAAGGTTGA 

TGCCGTTTTGTTAGGTGCTGTGGGTGGTCCTAAATGGGGTACCGGTAGTGTTAGACCTGA 

AC AAGGTTT ACT AAAAAT CCGT AAAGAACTTC AATTGT ACG C C AACT T AAGAC C ATGT AA 

CTTTGCATCCGACTCTCTTTTAGACTTATCTCCAATCAAGCCACAATTTGCTAAAGGTAC 

TGACTTCGTTGTTGTCAGAGAATTAGTGGGAGGTATTTACTTTGGTAAGAGAAAGGAAGA 

CGATGGTGATGGTGTCGCTTGGGATAGTGAACAATACACCGTTCCAGAAGTGCAAAGAAT 

CACAAGAATGGCCGCTTTCATGGCCCTACAACATGAGCCACCATTGCCTATTTGGTCCTT 

GGATAAAGCTAATGTTTTGGCCTCTTCAAGATTATGGAGAAAAACTGTGGAGGAAACCAT 

CAAGAACGAATTCCCTACATTGAAGGTTCAACATCAATTGATTGATTCTGCCGCCATGAT 

CCTAGTTAAGAACCCAACCCACCTAAATGGTATTATAATCACCAGCAACATGTTTGGTGA 

TATCATCTCCGATGAAGCCTCCGTTATCCCAGGTTCCTTGGGTTTGTTGCCATCTGCGTC 

CTTGGCCTCTTTGCCAGACAAGAACACCGCATTTGGTTTGTACGAACCATGCCACGGTTC 

TGCTCCAGATTTGCCAAAGAATAAGGTTGACCCTATCGCCACTATCTTGTCTGCTGCAAT 

GATGTTGAAATTGTCATTGAACTTGCCTGAAGAAGGTAAGGCCATTGAAGATGCAGTTAA 

AAAGGTTTTGGATGCAGGTATCAGAACTGGTGATTTAGGTGGTTCCAACAGTACCACCGA 

AGTCGGTGATGCTGTCGCCGAAGAAGTTAAGAAAATCCTTGCTTAAAAAGATTCTCTTTT 

TTTATGATATTTGTACATAAACTTTATAAATGAAATTCATAATAGAAACGACACGAAATT 

ACAAAATGGAAT ATGTT CATAGGGTAGACGAAACTATATACGC AAT CTACAT AC ATTT AT 

CAAGAAGGAGAAAAAGGAGGATAGTAAAGGAATACAGGTAAGCAAATTGATACTAATGGC 

TCAACGTGATAAGGAAAAAGAATTGCACTTTAACATTAATATTGACAAGGAGGAGGGCAC 

CACACAAAAAGTTAGGTGTAACAGAAAATCATGAAACTACGATTCCTAATTTGATATTGG 

AGGATTTTCTCTAAAAAAAAAAAAATACAACAAATAAAAAACACTCAATGACCTGACCAT 

TTGATGGAGTTTAAGTCAATACCTTCTTGAACCATTTCCCATAATGGTGAAAGTTCCCTC 

AAGAATTTTACTCTGTCAGAAACGGCCTTACGACGTAGTCGATATGGTGCACTCTCAGTA 

CAATCTGCTCTGATGCCGCATAGTTAAGCCAGCCCCGACACCCGCCAACACCCGCTGACG 

CGCCCTGACGGGCTTGTCTGCTCCCGGCATCCGCTTACAGACAAGCTGTGACCGTCTCCG 

GGAGCTGCATGTGTCAGAGGTTTTCACCGTCATCACCGAAACGCGCGA 
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pDEST3 3 8815 bp 

G C CT T AC G CAT CTGTG C GGT ATTT C AC AC CG C AGG C AAGTG C AC AAAC AAT ACTT AAAT A 

AATACTACTCAGTAATAACCTATTTCTTAGCATTTTTGACGAAATTTGCTATTTTGTTAG 

AGTCTTTTACACCATTTGTCTCCACACCTCCGCTTACATCAACACCAATAACGCCATTTA 

ATCTAAGCGCATCACCAACATTTTCTGGCGTCAGTCCACCAGCTAACATAAAATGTAAGC 

TTTCGGGGCTCTCTTGCCTTCCAACCCAGTCAGAAATCGAGTTCCAATCCAAAAGTTCAC 

CTGTCCCACCTGCTTCTGAATCAAACAAGGGAATAAACGAATGAGGTTTCTGTGAAGCTG 

CACTGAGTAGTATGTTGCAGTCTTTTGGAAATACGAGTCTTTTAATAACTGGCAAACCGA 

GGAACTCTTGGTATTCTTGCCACGACTCATCTCCATGCAGTTGGACGATATCAATGCCGT 

AATCATTGACCAGAGCCAAAACATCCTCCTTAGGTTGATTACGAAACACGCCAACCAAGT 

ATTTCGGAGTGCCTGAACTATTTTTATATGCTTTTACAAGACTTGAAATTTTCCTTGCAA 

TAACCGGGTCAATTGTTCTCTTTCTATTGGGCACACATATAATACCCAGCAAGTCAGCAT 

CGGAATCTAGAGCACATTCTGCGGCCTCTGTGCTCTGCAAGCCGCAAACTTTCACCAATG 

GACCAGAACTACCTGTGAAATTAATAACAGACATACTCCAAGCTGCCTTTGTGTGCTTAA 

TCACGTATACTCACGTGCTCAATAGTCACCAATGCCCTCCCTCTTGGCCCTCTCCTTTTC 

TTTTTTCGACCGAATTAATTCTTAATCGGCAAAA2\AAGAAAAGCTCCGGATCAAGATTGT 

ACGTAAGGTGACAAGCTATTTTTCAATAAAGAATATCTTCCACTACTGCCATCTGGCGTC 

ATAACTGCAAAGTACACATATATTACGATGCTGTCTATTAAATGCTTCCTATATTATATA 

TATAGTAATGTCGTTTATGGTGCACTCTCAGTACAATCTGCTCTGATGCCGCATAGTTAA 

GCCAGCCCCGACACCCGCCAACACCCGCTGACGCGCCCTGACGGGCTTGTCTGCTCCCGG 

CATCCGCTTACAGACAAGCTGTGACCGTCTCCGGGAGCTGCATGTGTCAGAGGTTTTCAC 

CGTCATCACCGAAACGCGCGAGACGAAAGGGCCTCGTGATACGCCTATTTTTATAGGTTA 

ATGTCATGATAATAATGGTTTCTTAGGACGGATCGCTTGCCTGTAACTTACACGCGCCTC 

GTATCTTTTAATGATGGAATAATTTGGGAATTTACTCTGTGTTTATTTATTTTTATGTTT 

TGTATTTGGATTTTAGAAAGTAAATAAAGAAGGTAGAAGAGTTACGGAATGAAGAAAAAA 

AAATAAACAAAGGTTTAAAAAATTTCAACAAAAAGCGTACTTTACATATATATTTATTAG 

ACAAGAAAAGCAGATTAAATAGATATACATTCGATTAACGATAAGTAAAATGTAAAATCA 

CAGGATTTTCGTGTGTGGTCTTCTACACAGACAAGATGAAACAATTCGGCATTAATACCT 

GAGAGCAGGAAGAGCAAGATAAAAGGTAGTATTTGTTGGCGATCCCCCTAGAGTCTTTTA 

CATCTTCGGAAAACAAAAACTATTTTTTCTTTAATTTCTTTTTTTACTTTCTATTTTTAA 

TTTATATATTTATATTAAAAAATTTAAATTATAATTATTTTTATAGCACGTGATGAAAAG 

GACCCAGGTGGCACTTTTCGGGGAAATGTGCGCGGAACCCCTATTTGTTTATTTTTCTAA 

ATACATTCAAATATGTATCCGCTCATGAGACAATAACCCTGATAAATGCTTCAATAATAT 

TGAAAAAGGAAGAGTATGAGTATTCAACATTTCCGTGTCGCCCTTATTCCCTTTTTTGCG 

GCATTTTGCCTTCCTGTTTTTGCTCACCCAGAAACGCTGGTGAAAGTAAAAGATGCTGAA 

GATCAGTTGGGTGCACGAGTGGGTTACATCGAACTGGATCTCAACAGCGGTAAGATCCTT 

G AG AG TTTT CG C C C CG AAG AACGTTTT C C AAT GAT GAG C ACTTTT AAAGTT C TG CT ATGT 

GGCGCGGTATTATCCCGTATTGACGCCGGGCAAGAGCAACTCGGTCGCCGCATACACTAT 

TCTCAGAATGACTTGGTTGAGTACTCACCAGTCACAGAAAAGCATCTTACGGATGGCATG 

ACAGTAAGAGAATTATGCAGTGCTGCCATAACCATGAGTGATAACACTGCGGCCAACTTA 

CTTCTGACAACGATCGGAGGACCGAAGGAGCTAACCGCTTTTTTTCACAACATGGGGGAT 

C ATGT AACT CG C CTTGAT CGTTGGGAAC CGG AG CTGAATGAAGC CAT AC CAAACGACGAG 

CGTGACACCACGATGCCTGTAGCAATGGCAACAACGTTGCGCAAACTATTAACTGGCGAA 

CTACTTACTCTAGCTTCCCGGCAACAATTAATAGACTGGATGGAGGCGGATAAAGTTGCA 

GGACCACTTCTGCGCTCGGCCCTTCCGGCTGGCTGGTTTATTGCTGATAAATCTGGAGCC 

GGTGAGCGTGGGTCTCGCGGTATCATTGCAGCACTGGGGCCAGATGGTAAGCCCTCCCGT 

ATCGTAGTTATCTACACGACGGGCAGTCAGGCAACTATGGATGAACGAAATAGACAGATC 

GCTGAGATAGGTGCCTCACTGATTAAGCATTGGTAACTGTCAGACCAAGTTTACTCATAT 

ATACTTTAGATTGATTTAAAACTTCATTTTTAATTTAAAAGGATCTAGGTGAAGATCCTT 

TTTGATAATCTCATGACCAAAATCCCTTAACGTGAGTTTTCGTTCCACTGAGCGTCAGAC 

CCCGTAGAAAAGATCAAAGGATCTTCTTGAGATCCTTTTTTTCTGCGCGTAATCTGCTGC 

TTGCAAACAAAAAAACCACCGCTACCAGCGGTGGTTTGTTTGCCGGATCAAGAGCTACCA 

ACTCTTTTTCCGAAGGTAACTGGCTTCAGCAGAGCGCAGATACCAAATACTGTCCTTCTA 

GTGTAGCCGTAGTTAGGCCACCACTTCAAGAACTCTGTAGCACCGCCTACATACCTCGCT 

CTGCTAATCCTGTTACCAGTGGCTGCTGCCAGTGGCGATAAGTCGTGTCTTACCGGGTTG 

GACTCAAGACGATAGTTACCGGATAAGGCGCAGCGGTCGGGCTGAACGGGGGGTTCGTGC 

ACACAGCCCAGCTTGGAGCGAACGACCTACACCGAACTGAGATACCTACAGCGTGAGCAT - 
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TGAGAAAGCGCCACGCTTCCCGAAGGGAGAAAGGCGGACAGGTATCCGGTAAGCGGCAGG 

GTCGGAACAGGAGAGCGCACGAGGGAGCTTCCAGGGGGGAACGCCTGGTATCTTTATAGT 

CCTGTCGGGTTTCGCCACCTCTGACTTGAGCGTCGATTTTTGTGATGCTCGTCAGGGGGG 

CCGAGCCTATGGAAAAACGCCAGCAACGCGGCCTTTTTACGGTTCCTGGCCTTTTGCTGG 

CCTTTTGCTCACATGTTCTTTCCTGCGTTATCCCCTGATTCTGTGGATAACCGTATTACC 

GCCTTTGAGTGAGCTGATACCGCTCGCCGCAGCCGAACGACCGAGCGCAGCGAGTCAGTG 

AGCGAGGAAGCGGAAGAGCGCCCAATACGCAAACCGCCTCTCCCCGCGCGTTGGCCGATT 

CATTAATGCAGCTGGCACGACAGGTTTCCCGACTGGAAAGCGGGCAGTGAGCGCAACGCA 

ATTAATGTGAGTTACCTCACTCATTAGGCACCCCAGGCTTTACACTTTATGCTTCCGGCT 

CCTATGTTGTGTGGAATTGTGAGCGGATAACAATTTCACACAGGAAACAGCTATGACCAT 

GATTACGCCAAGCTCGGAATTAACCCTCACTAAAGGGAACAAAAGCTGGGTACCGGGCCC 

CCCCTCGAGATCCGGGATCGAAGAAATGATGGTAAATGAAATAGGAAATCAAGGAGCATG 

AAGGCAAAAGACAAATATAAGGGTCGAACGAAAAATAAAGTGAAAAGTGTTGATATGATG 

TATTTGGCTTTGCGGCGCCGAAAAAACGAGTTTACGCAATTGCACAATCATGCTGACTCT 

GTGGCGGACCCGCGCTCTTGCCGGCCCGGCGATAACGCTGGGCGTGAGGCTGTGCCCGGC 

GGAGTTTTTTGCGCCTGCATTTTCCAAGGTTTACCCTGCGCTAAGGGGCGAGATTGGAGA 

AGCAATAAGAATGCCGGTTGGGGTTGCGATGATGACGACCACGACAACTGGTGTCATTAT 

TTAAGTTGCCGAAAGAACCTGAGTGCATTTGCAACATGAGTATACTAGAAGAATGAGCCA 

AGACTTGCGAGACGCGAGTTTGCCGGTGGTGCGAACAATAGAGCGACCATGACCTTGAAG 

GTGAGACGCGCATAACCGCTAGAGTACTTTGAAGAGGAAACAGCAATAGGGTTGCTACCA 

GTATAAATAGACAGGTACATACAACACTGGAAATGGTTGTCTGTTTGAGTACGCTTTCAA 

TTCATTTGGGTGTGCACTTTATTATGTTACAATATGGAAGGGAACTTTACACTTCTCCTA 

TGCACATATATTAATTAAAGTCCAATGCTAGTAGAGAAGGGGGGTAACACCCCTCCGCGC 

TCTTTTCCGATTTTTTTCTAAACCGTGGAATATTTCGGATATCCTTTTGTTGTTTCCGGG 

TGTACAATATGGACTTCCTCTTTTCTGGCAACCAAACCCATACATCGGGATTCCTATAAT 

ACCTTCGTTGGTCTCCCTAACATGTAGGTGGCGGAGGGGAGATATACAATAGAACAGATA 

CCAGACAAGACATAATGGGCTAAACAAGACTACACCAATTACACTGCCTCATTGATGGTG 

GTACATAACGAACTAATACTGTAGCCCTAGACTTGATAGCCATCATCATATCGAAGTTTC 

ACTACCCTTTTTCCATTTGCCATCTATTGAAGTAATAATAGGCGCATGCAACTTCTTTTC 

TTTTTTTTTCTTTTCTCTCTCCCCCGTTGTTGTCTCACCATATCCGCAATGACAAAAAAA 

ATGATGGAAGACACTAAAGGAAAAAATTAACGACAAAGACAGCACCAACAGATGTCGTTG 

TTCCAGAGCTGATGAGGGGTATCTTCGAACACACGAAACTTTTTCCTTCCTTCATTCACG 

CACACTACTCTCTAATGAGCAACGGTATACGGCCTTCCTTCCAGTTACTTGAATTTGAAA 

TAAAAAAAGTTTGCCGCTTTGCTATCAAGTATAAATAGACCTGCAATTATTAATCTTTTG 

TTTCCTCGTCATTGTTCTCGTTCCCTTTCTTCCTTGTTTCTTTTTCTGCACAATATTTCA 

AGCTATACCAAGCATACAATCAACTCCAAGCTTATGCCCAAGAAGAAGCGGAAGGTCTCG 

AGCGGCGCCAATTTTAATCAAAGTGGGAATATTGCTGATAGCTCATTGTCCTTCACTTTC 

ACTAACAGTAGCAACGGTCCGAACCTCATAACAACTCAAACAAATTCTCAAGCGCTTTCA 

CAACCAATTGCCTCCTCTAACGTTCATGATAACTTCATGAATAATGAAATCACGGCTAGT 

AAAATTGATGATGGTAATAATTCAAAACCACTGTCACCTGGTTGGACGGACCAAACTGCG 

TATAACGCGTTTGGAATCACTACAGGGATGTTTAATACCACTACAATGGATGATGTATAT 

AACTATCTATTCGATGATGAAGATACCCCACCAAACCCAAAAAAAGAGGGTGGGTCGAAT 

C AAAC AAGTTT GT AC AAAAAAG C TGAAC GAG AAACGT AAAATG AT AT AAAT AT C AAT AT A 

TT AAATT AG AT TTTG C AT AAAAAACAGACT AC ATAAT AG TGT AAAAC AC AAC AT AT C CAG 

TCACTATGGCGGCCGCTAAGTTGGCAGCATCACCCGACGCACTTTGCGCCGAATAAATAC 

CTGTGACGGAAGATCACTTCGCAGAATAAATAAATCCTGGTGTCCCTGTTGATACCGGGA 

AGCCCTGGGCCAACTTTTGGCGAAAATGAGACGTTGATCGGCACGTAAGAGGTTCCAACT 

TTCACCATAATGAAATAAGATCACTACCGGGCGTATTTTTTGAGTTATCGAGATTTTCAG 

GAGCTAAGGAAGCTAAAATGGAGAAAAAAATCACTGGATATACCACCGTTGATATATCCC 

AATGGCATCGTAAAGAACATTTTGAGGCATTTCAGTCAGTTGCTCAATGTACCTATAACC 

AGACCGTTCAGCTGGATATTACGGCCTTTTTAAAGACCGTAAAGAAAAATAAGCACAAGT 

TTTATCCGGCCTTTATTCACATTCTTGCCCGCCTGATGAATGCTCATCCGGAATTCCGTA 

TGGCAATGAAAGACGGTGAGCTGGTGATATGGGATAGTGTTCACCCTTGTTACACCGTTT 

TCCATGAGCAAACTGAAACGTTTTCATCGCTCTGGAGTGAATACCACGACGATTTCCGGC 

AGTTTCTACACATATATTCGCAAGATGTGGCGTGTTACGGTGAAAACCTGGCCTATTTCC 

CTAAAGGGTTTATTGAGAATATGTTTTTCGTCTCAGCCAATCCCTGGGTGAGTTTCACCA 

GTTTTGATTTAAACGTGGCCAATATGGACAACTTCTTCGCCCCCGTTTTCACCATGGGCA 

AATATTATACGCAAGGCGACAAGGTGCTGATGCCGCTGGCGATTCAGGTTCATCATGCCG- 
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TCTGTGATGGCTTCCATGTCGGCAGAATGCTTAATGAATTACAACAGTACTGCGATGAGT 

GGCAGGGCGGGGCGTAATCTAGAGGATCCGGCTTACTAAAAGCCAGATAACAGTATGCGT 

ATTTGCGCGCTGATTTTTGCGGTATAAGAATATATACTGATATGTATACCCGAAGTATGT 

CAAAAAGAGGTGTGCTATGAAGCAGCGTATTACAGTGACAGTTGACAGCGACAGCTATCA 

GTTGCTCAAGGCATATATGATGTCAATATCTCCGGTCTGGTAAGCACAACCATGCAGAAT 

GAAGCCCGTCGTCTGCGTGCCGAACGCTGGAAAGCGGAAAATCAGGAAGGGATGGCTGAG 

GTCGCCCGGTTTATTGAAATGAACGGCTCTTTTGCTGACGAGAACAGGGACTGGTGAAAT 

GCAGTTTAAGGTTTACACCTATAAAAGAGAGAGCCGTTATCGTCTGTTTGTGGATGTACA 

GAGTGATATTATTGACACGCCCGGGCGACGGATGGTGATGCCCCTGGCCAGTGCACGTCT 

GCTGTCAGATAAAGTCTCCCGTGAACTTTACCCGGTGGTGCATATCGGGGATGAAAGCTG 

GCGCATGATGACCACCGATATGGCCAGTGTGCCGGTCTCCGTTATCGGGGAAGAAGTGGC 

TGATCTCAGCCACCGCGAAAATGACATCAAAAACGCCATTAACCTGATGTTCTGGGGAAT 

ATAAATGTCAGGCTCCGTTATACACAGCCAGTCTGCAGGTCGACCATAGTGACTGGATAT 

GTTGTGTTTTACAGTATTATGTAGTCTGTTTTTTATGCAAAATCTAATTTAATATATTGA 

TATTTATATCATTTTACGTTTCTCGTTCAGCTTTCTTGTACAAAGTGGTTTGATGGCCGC 

TAAGTAAGTAAGACGTCGAGCTCCCTATAGTGAGTCGTATTACACTGGCCGTCGTTTTAC 

AACGTCGTGACTGGGAAAACACCGGTGAGCTCTAAGTAAGTAACGGCCGCCACCGCGGTG 

GAGCTTTGGACTTCTTCGCCAGAGGTTTGGTCAAGTGTCCAATCAAGGTTGTCGGCTTGT 

CTACCTTGCCAGAAATTTACGAAAAGATGGAAAAGGGTCAAATCGTTGGTAGATACGTTG 

TTGACACTTCTAAATAAGCGAATTTCTTATGATTTATGATTTTTATTATTAAATAAGTTA 

TAAAAAAAATAAGTGTATACAAATTTTAAAGTGACTCTTAGGTTTTAAAACGAAAATTCT 

TGTTCTTGAGTAACTCTTTCCTGTAGGTCAGGTTGCTTTCTCAGGTATAGCATGAGGTCG 

CTCTTATTGACCACACCTCTACCGGCATGCCGAGCAAATGCCTGCAAATCGCTCCCCATT 

TCACCCAATTGTAGATATGCTAACTCCAGCAATGAGTTGATGAATCTCGGTGTGTATTTT 

ATGTCCTCAGAGGACAATACCTGTTGTAATCGTTCTTCCACACGGATCCGCATCAGGCGA 

AATTGTAAACGTTAATATTTTGTTAAAATTCGCGTTAAATATTTGTTAAATCAGCTCATT 

TT TT AAC C AAT AGG C CGAAAT CGGCAAAAT C C CTT ATAAAT C AAAAG AAT AG AC CG AG AT 

AGGGTTGAGTGTTGTTCCAGTTTGGAACAAGAGTCCACTATTAAAGAACGTGGACTCCAA 

CGTCAAAGGGCGAAAAACCGTCTATCAGGGCGATGGCGCACTACGTGAACCATCACCCTA 

ATCAAGTTTTTTGGGGTCGAGGTGCCGTAAAGCACTAAATCGGAACCCTAAAGGGAGCCC 

CCGATTTAGAGCTTGACGGGGAAAGCCGGCGAACGTGGCGAGAAAGGAAGGGAAGAAAGC 

GAAAGGAGCGGGCGCTAGGGCGCTGGCAAGTGTAGCGGTCACGCTGCGCGTAACCACCAC 

ACCCGCCGCGCTTAATGCGCCGCTACAGGGCGCGTCCCATTCGCCATTCACTGCA 
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pDEST34 7114 bp 

Location (Base Nos.) Gene Encoded 

195 . . 71 attRl 

304 . . 963 CmR 

1305 . . 1610 ccdB 

1651 . . 1775 attR2 

1780 . . 2472 GST 

2675 . . 2720 T7stop 

3 3 34 . .4194 ampR 

4343 . .4982 ori 

ATCGAGATCTCGATCCCGCGAAATTAATACGACTCACTATAGGGAGACCACAACGGTTTC 

C CT CTAG AT C AC AAGTTTGTACAAAAAAG CTGAACGAGAAACGTAAAATGATATAAATAT 

CAATATATTAAATTAGATTTTGCATAAAAAACAGACTACATAATACTGTAAAACACAACA 

TATCCAGTCACTATGGCGGCCGCATTAGGCACCCCAGGCTTTACACTTTATGCTTCCGGC 

TCGTATAATGTGTGGATTTTGAGTTAGGATCCGGCGAGATTTTCAGGAGCTAAGGAAGCT 

AAAATGGAGAAAAAAAT C ACTGGATATAC C ACCGTTGATATAT C C C AATGGCAT CGT AAA 

GAACATTTTGAGGCATTTCAGTCAGTTGCTCAATGTACCTATAACCAGACCGTTCAGCTG 

GATATTACGGCCTTTTTAAAGACCGTAAAGAAAAATAAGCACAAGTTTTATCCGGCCTTT 

ATTCACATTCTTGCCCGCCTGATGAATGCTCATCCGGAATTCCGTATGGCAATGAAAGAC 

GGT G AG CT GGT G AT ATGGG AT AGTGTT C AC C CTTGTT AC AC CGTTTT C C ATG AG CAAACT 

GAAACGTTTTCATCGCTCTGGAGTGAATACCACGACGATTTCCGGCAGTTTCTACACATA 

TATTCGCAAGATGTGGCGTGTTACGGTGAAAACCTGGCCTATTTCCCTAAAGGGTTTATT 

GAGAATATGTTTTTCGTCTCAGCCAATCCCTGGGTGAGTTTCACCAGTTTTGATTTAAAC 

GTGGCCAATATGGACAACTTCTTCGCCCCCGTTTTCACCATGGGCAAATATTATACGCAA 

GGCGACAAGGTGCTGATGCCGCTGGCGATTCAGGTTCATCATGCCGTCTGTGATGGCTTC 

CATGTCGGCAGAATGCTTAATGAATTACAACAGTACTGCGATGAGTGGCAGGGCGGGGCG 

TAAACGCGTGGATCCGGCTTACTAAAAGCCAGATAACAGTATGCGTATTTGCGCGCTGAT 

TTTTGCGGTATAAGAATATATACTGATATGTATACCCGAAGTATGTCAAAAAGAGGTGTG 

CTATGAAGCAGCGTATTACAGTGACAGTTGACAGCGACAGCTATCAGTTGCTCAAGGCAT 

ATATGATGTCAATATCTCCGGTCTGGTAAGCACAACCATGCAGAATGAAGCCCGTCGTCT 

GCGTGCCGAACGCTGGAAAGCGGAAAATCAGGAAGGGATGGCTGAGGTCGCCCGGTTTAT 

TGAAATGAACGGCTCTTTTGCTGACGAGAACAGGGACTGGTGAAATGCAGTTTAAGGTTT 

ACACCTATAAAAGAGAGAGCCGTTATCGTCTGTTTGTGGATGTACAGAGTGATATTATTG 

ACACGCCCGGGCGACGGATGGTGATCCCCCTGGCCAGTGCACGTCTGCTGTCAGATAAAG 

TCTCCCGTGAACTTTACCCGGTGGTGCATATCGGGGATGAAAGCTGGCGCATGATGACCA 

CCGATATGGCCAGTGTGCCGGTCTCCGTTATCGGGGAAGAAGTGGCTGATCTCAGCCACC 

G CGAAAATGAC AT C AAAAACG C CATTAACC TGATGTTC TGGGGAAT ATAAATGT C AGG CT 

CCCTTATACACAGCCAGTCTGCAGGTCGACCATAGTGACTGGATATGTTGTGTTTTACAG 

TATTATGTAGTCTGTTTTTTATGCAAAATCTAATTTAATATATTGATATTTATATCATTT 

TACGTTTCTCGTTCAGCTTTCTTGTACAAAGTGGTGATTATGTCCCCTATACTAGGTTAT 

TGGAAAATTAAGGGCCTTGTGCAACCCACTCGACTTCTTTTGGAATATCTTGAAGAAAAA 

TATGAAGAGCATTTGTATGAGCGCGATGAAGGTGATAAATGGCGAAACAAAAAGTTTGAA 

TTGGGTTTGGAGTTTCCCAATCTTCCTTATTATATTGATGGTGATGTTAAATTAACACAG 

TCTATGGCCATCATACGTTATATAGCTGACAAGCACAACATGTTGGGTGGTTGTCCAAAA 

GAGGGTGCAGAGATTTCAATGCTTGAAGGAGCGGTTTTGGATATTAGATACGGTGTTTCG 

AGAATTGCATATAGTAAAGACTTTGAAACTCTCAAAGTTGATTTTCTTAGCAAGCTACCT 

GAAATGCTGAAAATGTTCGAAGATCGTTTATGTCATAAAACATATTTAAATGGTGATCAT 

GTAACCCATCCTGACTTCATGTTGTATGACGCTCTTGATGTTGTTTTATACATGGACCCA 

ATGTGCCTGGATGCGTTCCCAAAATTAGTTTGTTTTAAAAAACGTATTGAAGCTATCCCA 

CAAATTGATAAGTACTTGAAATCCAGCAAGTATATAGCATGGCCTTTGCAGGGCTGGCAA 

GCCACGTTTGGTGGTGGCGACCATCCTCCAAAATCGGATCTGGTTCCGCGTCCATGGGGA 

TCCGGCTGCTAACAAAGCCCGAAAGGAAGCTGAGTTGGCTGCTGCCACCGCTGAGCGCTT 

CCCGATAAGGGAGCAGGCCAGTAAAAGCATTACCCGTGGTGGGGTTCCCGAGCGGCCAAA 

GGGAGCAGACTCTAAATCTGCCGTCATCGACTTCGAAGGTTCGAATCCTTCCCCCACCAC 

CATCACTTTCAAAAGTGAATTCGCTGAGCAATAACTAGCATAACCCCTTGGGGCCTCTAA- 
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ACGGGTCTTGAGGGGTTTTTTGCTGAAAGGAGGAACTATATCCGGATATCCACAGGACGG 

GTGTGGTCGCCATGATCGCGTAGTCGATAGTGGCTCCAAGTAGCGAAGCGAGCAGGACTG 

GGCGGCGGCCAAAGCGGTCGGACAGTGCTCCGAGAACGGGTGCGCATAGAAATTGCATCA 

ACGCATATAGCGCTAGCAGCACGCCATAGTGACTGGCGATGCTGTCGGAATGGACGATAT 

C C CG C AAG AGGC C CGGCAGTACCGGCATAACCAAGCCTATGCCTAC AG C AT CCAGGGTG A 

CGGTGCCGAGGATGACGATGAGCGCATTGTTAGATTTCATACACGGTGCCTGACTGCGTT 

AGCAATTTAACTGTGATAAACTACCGCATTAAAGCTTATCGATGATAAGCTGTCAAAGAT 

GAGAATTCTTGAAGACGAAAGGGCCTCGTGATACGCCTATTTTTATAGGTTAATGTCATG 

ATAATAATGGTTTCTTAGACGTCAGGTGGCACTTTTCGGGGAAATGTGCGCGGAACCCCT 

ATTTGTTTATTTTTCTAAATACATTCAAATATGTATCCGCTCATGAGACAATAACCCTGA 

TAAATGCTT CAAT AATATTGAAAAAGGAAGAGTATGAGTATTCAACATTT CCGTGTCGC C 

CTTATTCCCTTTTTTGCGGCATTTTGCCTTCCTGTTTTTGCTCACCCAGAAACGCTGGTG 

AAAGTAAAAGATGCTGAAGATCAGTTGGGTGCACGAGTGGGTTACATCGAACTGGATCTC 

AACAGCGGTAAGATCCTTGAGAGTTTTCGCCCCGAAGAACGTTTTCCAATGATGAGCACT 

TTTAAAGTTCTGCTATGTGGCGCGGTATTATCCCGTGTTGACGCCGGGCAAGAGCAACTC 

GGTCGCCGCATACACTATTCTCAGAATGACTTGGTTGAGTACTCACCAGTCACAGAAAAG 

CATCTTACGGATGGCATGACAGTAAGAGAATTATGCAGTGCTGCCATAACCATGAGTGAT 

AACACTGCGGCCAACTTACTTCTGACAACGATCGGAGGACCGAAGGAGCTAACCGCTTTT 

TTGCACAACATGGGGGATCATGTAACTCGCCTTGATCGTTGGGAACCGGAGCTGAATGAA 

GCCATACCAAACGACGAGCGTGACACCACGATGCCTGCAGCAATGGCAACAACGTTGCGC 

AAACTATTAACTGGCGAACTACTTACTCTAGCTTCCCGGCAACAATTAATAGACTGGATG 

GAGGCGGATAAAGTTGCAGGACCACTTCTGCGCTCGGCCCTTCCGGCTGGCTGGTTTATT 

GCTGATAAATCTGGAGCCGGTGAGCGTGGGTCTCGCGGTATCATTGCAGCACTGGGGCCA 

GATGGTAAGCCCTCCCGTATCGTAGTTATCTACACGACGGGGAGTCAGGCAACTATGGAT 

GAACGAAATAGACAGATCGCTGAGATAGGTGCCTCACTGATTAAGCATTGGTAACTGTCA 

GACCAAGTTTACTCATATATACTTTAGATTGATTTAAAACTTCATTTTTAATTTAAAAGG 

ATCTAGGTGAAGATCCTTTTTGATAATCTCATGACCAAAATCCCTTAACGTGAGTTTTCG 

TTCCACTGAGCGTCAGACCCCGTAGAAAAGATCAAAGGATCTTCTTGAGATCCTTTTTTT 

CTGCGCGTAATCTGCTGCTTGCAAACAAAAAAACCACCGCTACCAGCGGTGGTTTGTTTG 

CCGGATCAAGAGCTACCAACTCTTTTTCCGAAGGTAACTGGCTTCAGCAGAGCGCAGATA 

CCAAATACTGTCCTTCTAGTGTAGCCGTAGTTAGGCCACCACTTCAAGAACTCTGTAGCA 

CCGCCTACATACCTCGCTCTGCTAATCCTGTTACCAGTGGCTGCTGCCAGTGGCGATAAG 

TCGTGTCTTACCGGGTTGGACTCAAGACGATAGTTACCGGATAAGGCGCAGCGGTCGGGC 

TGAACGGGGGGTTCGTGCACACAGCCCAGCTTGGAGCGAACGACCTACACCGAACTGAGA 

TACCTACAGCGTGAGCTATGAGAAAGCGCCACGCTTCCCGAAGGGAGAAAGGCGGACAGG 

TATCCGGTAAGCGGCAGGGTCGGAACAGGAGAGCGCACGAGGGAGCTTCCAGGGGGAAAC 

GCCTGGTATCTTTATAGTCCTGTCGGGTTTCGCCACCTCTGACTTGAGCGTCGATTTTTG 

TGATGCTCGTCAGGGGGGCGGAGCCTATGGAAAAACGCCAGCAACGCGGCCTTTTTACGG 

TTCCTGGCCTTTTGCTGGCCTTTTGCTCACATGTTCTTTCCTGCGTTATCCCCTGATTCT 

GTGGATAACCGTATTACCGCCTTTGAGTGAGCTGATACCGCTCGCCGCAGCCGAACGACC 

GAGCGCAGCGAGTCAGTGAGCGAGGAAGCGGAAGAGCGCCTGATGCGGTATTTTCTCCTT 

ACGCATCTGTGCGGTATTTCACACCGCATATATGGTGCACTCTCAGTACAATCTGCTCTG 

ATGCCGCATAGTTAAGCCAGTATACACTCCGCTATCGCTACGTGACTGGGTCATGGCTGC 

GCCCCGACACCCGCCAACACCCGCTGACGCGCCCTGACGGGCTTGTCTGCTCCCGGCATC 

CGCTTACAGACAAGCTGTGACCGTCTCCGGGAGCTGCATGTGTCAGAGGTTTTCACCGTC 

AT C AC CG AAACG CG CGAGG C AGCTGCGGTAAAGCTCAT CAGCGTGGT CGTGAAG CGATT C 

ACAGATGTCTGCCTGTTCATCCGCGTCCAGCTCGTTGAGTTTCTCCAGAAGCGTTAATGT 

CTGGCTTCTGATAAAGCGGGCCATGTTAAGGGCGGTTTTTTCCTGTTTGGTCACTGATGC 

CTCCGTGTAAGGGGGATTTCTGTTCATGGGGGTAATGATACCGATGAAACGAGAGAGGAT 

GCTCACGATACGGGTTACTGATGATGAACATGCCCGGTTACTGGAACGTTGTGAGGGTAA 

ACAACTGGCGGTATGGATGCGGCGGGACCAGAGAAAAATCACTCAGGGTCAATGCCAGCG 

CTTCGTTAATACAGATGTAGGTGTTCCACAGGGTAGCCAGCAGCATCCTGCGATGCAGAT 

CCGGAACATAATGGTGCAGGGCGCTGACTTCCGCGTTTCCAGACTTTACGAAACACGGAA 

ACCGAAGACCATTCATGTTGTTGCTCAGGTCGCAGACGTTTTGCAGCAGCAGTCGCTTCA 

CGTTCGCTCGCGTATCGGTGATTCATTCTGCTAACCAGTAAGGCAACCCCGCCAGCCTAG 

CCGGGTCCTCAACGACAGGAGCACGATCATGCGCACCCGTGGCCAGGACCCAACGCTGCC 

CGAGATGCGCCGCGTGCGGCTGCTGGAGATGGCGGACGCGATGGATATGTTCTGCCAAGG 

GTTGGTTTGCGCATTCACAGTTCTCCGCAAGAATTGATTGGCTCCAATTCTTGGAGTGGT- 
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GAATCCGTTAGCGAGGTGCCGCCGGCTTCCATTCAGGTCGAGGTGGCCCGGCTCCATGCA 
CCGCGACGCAACGCGGGGAGGCAGACAAGGTATAGGGCGGCGCCTACAATCCATGCCAAC 
CCGTTCCATGTGCTCGCCGAGGCGGCATAAATCGCCGTGACGATCAGCGGTCCAGTGATC 
GAAGTTAGGCTGGTAAGAGCCGCGAGCGATCCTTGAAGCTGTCCCTGATGGTCGTCATCT 
ACCTGCCTGGACAGCATGGCCTGCAACGCGGGCATCCCGATGCCGCCGGAAGCGAGAAGA 
ATCATAATGGGGAAGGCCATCCAGCCTCGCGTCGCGAACGCCAGCAAGACGTAGCCCAGC 
GCGTCGGCCGCCATGCCGGCGATAATGGCCTGCTTCTCGCCGAAACGTTTGGTGGCGGGA 
CCAGTGACGAAGGCTTGAGCGAGGGCGTGCAAGATTCCGAATACCGCAAGCGACAGGCCG 
ATCATCGTCGCGCTCCAGCGAAAGCGGTCCTCGCCGAAAATGACCCAGAGCGCTGCCGGC 
ACCTGTCCTACGAGTTGCATGATAAAGAAGACAGTCATAAGTGCGGCGACGATAGTCATG 
C C C CG CG C CCAC CGGAAGGAGCTGACTGGGTTGAAGGCT CTCAAGGG CAT CGGT CGAT CG 
ACGCTCTCCCTTATGCGACTCCTGCATTAGGAAGCAGCCCAGTAGTAGGTTGAGGCCGTT 
GAGCACCGCCGCCGCAAGGAATGGTGCATGCAAGGAGATGGCGCCCAACAGTCCCCCGGC 
CACGGGGCCTGCCACCATACCCACGCCGAAACAAGCGCTCATGAGCCCGAAGTGGCGAGC 
CCGATCTTCCCCATCGGTGATGTCGGCGATATAGGCGCCAGCAACCGCACCTGTGGCGCC 
GGTGATGCCGGCCACGATGCGTCCGGCGTAGAGG 
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pDONR2 07 55 84 bp 

GCGAGAGTAGGGAACTGCCAGGCATCAAATAAAACGAAAGGCTCAGTCGGAAGACTGGGC 

CTTTCGTTTTATCTGTTGTTTGTCGGTGAACGCTCTCCTGAGTAGGACAAATCCGCCGGG 

AGCGGATTTGAACGTTGTGAAGCAACGGCCCGGAGGGTGGCGGGCAGGACGCCCGCCATA 

AACTGCCAGGCATCAAACTAAGCAGAAGGCCATCCTGACGGATGGCCTTTTTGCGTTTCT 

ACAAACTCTTCCTGGCTAGCGGTAATACGGTTATCCACAGAATCAGGGGATAACGCAGGA 

AAGAACATGTGAGCAAAAGGCCAGCAAAAGGCCAGGAACCGTAAAAAGGCCGCGTTGCTG 

GCGTTTTTCCATAGGCTCCGCCCCCCTGACGAGCATCACAAAAATCGACGCTCAAGTCAG 

AGGTGGCGAAACCCGACAGGACTATAAAGATACCAGGCGTTTCCCCCTGGAAGCTCCCTC 

GTGCGCTCTCCTGTTCCGACCCTGCCGCTTACCGGATACCTGTCCGCCTTTCTCCCTTCG 

GGAAGCGTGGCGCTTTCTCATAGCTCACGCTGTAGGTATCTCAGTTCGGTGTAGGTCGTT 

CGCTCCAAGCTGGGCTGTGTGCACGAACCCCCCGTTCAGCCCGACCGCTGCGCCTTATCC 

GGTAACTATCGTCTTGAGTCCAACCCGGTAAGACACGACTTATCGCCACTGGCAGCAGCC 

ACTGGTAACAGGATTAGCAGAGCGAGGTATGTAGGCGGTGCTACAGAGTTCTTGAAGTGG 

TGGCCTAACTACGGCTACACTAGAAGGACAGTATTTGGTATCTGCGCTCTGCTGAAGCCA 

GTTACCTTCGGAAAAAGAGTTGGTAGCTCTTGATCCGGCAAACAAACCACCGCTGGTAGC 

GGTGGTTTTTTTGTTTGCAAGCAGCAGATTACGCGCAGAAAAAAAGGATCTCAAGAAGAT 

CCTTTGATCTTTTCTACGGGGTCTGACGCTCAGTGGAACGAAAACTCACGTTAAGGGATT 

TTGGTCATGAGCTTGCGCCGTCCCGTCAAGTCAGCGTAATGCTCTGCCAGTGTTACAACC 

AATTAACCAATTCTGATTAGAAAAACTCATCGAGCATCAAATGAAACTGCAATTTATTCA 

TATCAGGATTATCAATACCATATTTTTGAAAAAGCCGTTTCTGTAATGAAGGAGAAAACT 

CACCGAGGCAGTTCCATAGGATGGCAAGATCCTGGTATCGGTCTGCGATTCCGACTCGTC 

CAACATCAATACAACCTATTAGTAGCCAACCACTAGAACTATAGCTAGAGTCCTGGGCGA 

ACAAACGATGCTCGCCTTCCAGAAAACCGAGGATGCGAACCACTTCATCCGGGGTCAGCA 

CCACCGGCAAGCGCCGCGACGGCCGAGGTCTTCCGATCTCCTGAAGCCAGGGCAGATCCG 

TGCACAGCACCTTGCCGTAGAAGAACAGCAAGGCCGCCAATGCCTGACGATGCGTGGAGA 

CCGAAACCTTGCGCTCGTTCGCCAGCCAGGACAGAAATGCCTCGACTTCGCTGCTGCCCA 

AGGTTGCCGGGTGACGCACACCGTGGAAACGGATGAAGGCACGAACCCAGTTGACATAAG 

CCTGTTCGGTTCGTAAACTGTAATGCAAGTAGCGTATGCGCTCACG^AACTGGTCCAGAA 

CCTTGACCGAACGCAGCGGTGGTAACGGCGCAGTGGCGGTTTTCATGGCTTGTTATGACT 

GTTTTTTTGTACAGTCTATGCCTCGGGCATCCAAGCAGCAAGCGGGTTACGCCGTGGGTC 

GATGTTTGATGTTATGGAGCAGCAACGATGTTACGCAGCAGCAACGATGTTACGCAGCAG 

GGCAGTCGCCCTAAAACAAAGTTAGGTGGCTCAAGTATGGGCATCATTCGCACATGTAGG 

CTCGGCCCTGACCAAGTCAAATCCATGCGGGCTGCTCTTGATCTTTTCGGTCGTGAGTTC 

GGAGACGTAGCCACCTACTCCCAACATCAGCCGGACTCCGATTACCTCGGGAACTTGCTC 

CGTAGTAAGACATTCATCGCGCTTGCTGCCTTCGACCAAGAAGCGGTTGTTGGCGCTCTC 

GCGGCTTACGTTCTGCCCAGGTTTGAGCAGCCGCGTAGTGAGATCTATATCTATGATCTC 

GCAGTCTCCGGCGAGCACCGGAGGCAGGGCATTGCCACCGCGCTCATCAATCTCCTCAAG 

CATGAGGCCAACGCGCTTGGTGCTTATGTGATCTACGTGCAAGCAGATTACGGTGACGAT 

CCCGCAGTGGCTCTCTATACAAAGTTGGGCATACGGGAAGAAGTGATGCACTTTGATATC 

GACCCAAGTACCGCCACCTAACAATTCGTTCAAGCCGAGATCGGCTTCCCGGCCTAATTT 

C C C CT CGT C AAAAAT AAGGTT AT C AAGTG AGAAAT C ACCATGAGTGAC GACTG AAT C CGG 

TGAGAATGGCAAAAGTTTATGCATTTCTTTCCAGACTTGTTCAACAGGCCAGCCATTACG 

CT CGT CAT C AAAAT C ACT C G CAT C AAC C AAAC CGTTATT C ATT CGTGATTG CG C CTGAG C 

GAGACGAAATACGCGATCGCTGTTAAAAGGACAATTACAAACAGGAATCGAATGCAACCG 

G C G C AGGAACACTG C C AGC G C ATCAAC AAT ATTTT CAC CTGAAT C AGGATATT CTT CT AA 

TACCTGGAATGCTGTTTTTCCGGGGATCGCAGTGGTGAGTAACCATGCATCATCAGGAGT 

ACGGATAAAATGCTTGATGGTCGGAAGAGGCATAAATTCCGTCAGCCAGTTTAGTCTGAC 

CATCTCATCTGTAACATCATTGGCAACGCTACCTTTGCCATGTTTCAGAAACAACTCTGG 

CGCATCGGGCTTCCCATACAAGCGATAGATTGTCGCACCTGATTGCCCGACATTATCGCG 

AGCCCATfTATACCCATATAAATCAGCATCCATGTTGGAATTTAATCGCGGCCTCGACGT 

TTCCCGTTGAATATGGCTCATAACACCCCCTGTATTACTGTTTATGTAAGCAGACAGTTT 

TATTGTTCATGATGATATATTTTTATCTTGTGCAATGTAACATCAGAGATTTTGAGACAC 

GGGCCAGAGCTGCAGCTGGATGGCAAATAATGATTTTATTTTGACTGATAGTGACCTGTT 

CGTTGCAACAAATTGATAAGCAATGCTTTCTTATAATGCCAACTTTGTACAAGAAAGCTG 

AACGAGAAACGTAAAATGATATAAATATCAATATATTAAATTAGATTTTGCATAAAAAAC 

AGACTACATAATACTGTAAAACACAACATATCCAGTCACTATGAATCAACTACTTAGATG- 
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GTATTAGTGACCTGTAGTCGACTAAGTTGGCAGCATCACCCGACGCACTTTGCGCCGAAT 

AAATACCTGTGACGGAAGATCACTTCGCAGAATAAATAAATCCTGGTGTCCCTGTTGATA 

CCGGGAAGCCCTGGGCCAACTTTGGCGAAAATGAGACGTTGATCGGCACGTAAGAGGTTC 

CAACTTTCACCATAATGAAATAAGATCACTACCGGGCGTATTTTTTGAGTTATCGAGATT 

TTCAGGAGCTAAGGAAGCTAAAATGGAGAAAAAAATCACTGGATATACCACCGTTGATAT 

ATCCCAATGGCATCGTAAAGAACATTTTGAGGCATTTCAGTCAGTTGCTCAATGTAGCTA 

TAACCAGACCGTTCAGCTGGATATTACGGCCTTTTTAAAGACCGTAAAGAAAAATAAGCA 

CAAGTTTTATCCGGCCTTTATTCACATTCTTGCCCGCCTGATGAATGCTCATCCGGAATT 

CCGTATGGCAATGAAAGACGGTGAGCTGGTGATATGGGATAGTGTTCACCCTTGTTACAC 

CGTTTTCCATGAGCAAACTGAAACGTTTTCATCGCTCTGGAGTGAATACCACGACGATTT 

CCGGCAGTTTCTACACATATATTCGCAAGATGTGGCGTGTTACGGTGAAAACCTGGCCTA 

TTTCCCTAAAGGGTTTATTGAGAATATGTTTTTCGTCTCAGCCAATCCCTGGGTGAGTTT 

CACCAGTTTTGATTTAAACGTGGCCAATATGGACAACTTCTTCGCCCCCGTTTTCACCAT 

GGGCAAATATTATACGCAAGGCGACAAGGTGCTGATGCCGCTGGCGATTCAGGTTCATCA 

TGCCGTCTGTGATGGCTTCCATGTCGGCAGAATGCTTAATGAATTACAACAGTACTGCGA 

TGAG T GG C AGGG CGGGG CGT AAT CG CGTGGAT C CGG CTT ACT AAAAG C C AGAT AAC AGT A 

TGCGTATTTGCGCGCTGATTTTTGCGGTATAAGAATATATACTGATATGTATACCCGAAG 

TATGTCAAAAAGAGGTGTGCTATGAAGCAGCGTATTACAGTGACAGTTGACAGCGACAGC 

TATCAGTTGCTCAAGGCATATATGATGTCAATATCTCCGGTCTGGTAAGCACAACCATGC 

AGAATGAAGCCCGTCGTCTGCGTGCCGAACGCTGGAAAGCGGAAAATCAGGAAGGGATGG 

CTGAGGTCGCCCGGTTTATTGAAATGAACGGCTCTTTTGCTGACGAGAACAGGGACTGGT 

GAAATGCAGTTTAAGGTTTACACCTATAAAAGAGAGAGCCGTTATCGTCTGTTTGTGGAT 

GTACAGAGTGATATTATTGACACGCCCGGGCGACGGATGGTGATCCCCCTGGCCAGTGCA 

CGTCTGCTGTCAGATAAAGTCTCCCGTGAACTTTACCCGGTGGTGCATATCGGGGATGAA 

AGCTGGCGCATGATGACCACCGATATGGCCAGTGTGCCGGTCTCCGTTATCGGGGAAGAA 

GTGGCTGATCTCAGCCACCGCGAAAATGACATCAAAAACGCCATTAACCTGATGTTCTGG 

GGAATATAAATGTCAGGCTCCCTTATACACAGCCAGTCTGCAGGTCGATACAGTAGAAAT 

TACAGAAACTTTATCACGTTTAGTAAGTATAGAGGCTGAAAATCCAGATGAAGCCGAACG 

ACTTGTAAGAGAAAAGTATAAGAGTTGTGAAATTGTTCTTGATGCAGATGATTTTCAGGA 

CTATGACACTAGCGTATATGAATAGGTAGATGTTTTTATTTTGTCACACAAAAAAGAGGC 

TCGCACCTCTTTTTCTTATTTCTTTTTATGATTTAATACGGCATTGAGGACAATAGCGAG 

TAGGCTGGATACGACGATTCCGTTTGAGAAGAACATTTGGAAGGCTGTCGGTCGACTAAG 

TTGGCAGCATCACCCGAAGAACATTTGGAAGGCTGTCGGTCGACTACAGGTCACTAATAC 

CATCTAAGTAGTTGATTCATAGTGACTGGATATGTTGTGTTTTACAGTATTATGTAGTCT 

GTTTTTTATGCAAAATCTAATTTAATATATTGATATTTATATCATTTTACGTTTCTCGTT 

CAGCTTTTTTGTACAAAGTTGGCATTATAAAAAAGCATTGCTCATCAATTTGTTGCAACG 

AACAGGTCACTATCAGTCAAAATAAAATCATTATTTGGGGCCCGAGATCCATGCTAGCGT 

TAAC 
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G C C T T ACG CAT C TGT G CGGT ATTT C AC AC CGC AGG C AAGTG C AC AAAC AAT AC TT AAAT A 

AATACTACTCAGTAATAACCTATTTCTTAGCATTTTTGACGAAATTTGCTATTTTGTTAG 

AGTCTTTTACACCATTTGTCTCCACACCTCCGCTTACATCAACACCAATAACGCCATTTA 

ATCTAAGCGCATCACCAACATTTTCTGGCGTCAGTCCACCAGCTAACATAAAATGTAAGC 

TTTCGGGGCTCTCTTGCCTTCCAACCCAGTCAGAAATCGAGTTCCAATCCAAAAGTTCAC 

CTGTCCCACCTGCTTCTGAATCAAACAAGGGAATAAACGAATGAGGTTTCTGTGAAGCTG 

C ACTGAGT AGTATGTTG C AGT CTTTTGGAAATACGAGT CTTTTAAT AACTGG CAAAC CG A 

GGAACTCTTGGTATTCTTGCCACGACTCATCTCCATGCAGTTGGACGATATCAATGCCGT 

AATCATTGACCAGAGCCAAAACATCCTCCTTAGGTTGATTACGAAACACGCCAACCAAGT 

ATTTCGGAGTGCCTGAACTATTTTTATATGCTTTTACAAGACTTGAAATTTTCCTTGCAA 

TAACCGGGTCAATTGTTCTCTTTCTATTGGGCACACATATAATACCCAGCAAGTCAGCAT 

CGGAATCTAGAGCACATTCTGCGGCCTCTGTGCTCTGCAAGCCGCAAACTTTCACCAATG 

GACCAGAACTACCTGTGAAATTAATAACAGACATACTCCAAGCTGCCTTTGTGTGCTTAA 

TCACGTATACTCACGTGCTCAATAGTCACCAATGCCCTCCCTCTTGGCCCTCTCCTTTTC 

TTTTTTCGACCGAATTAATTCTTAATCGGCAAAAAAAGAAAAGCTCCGGATCAAGATTGT 

ACGTAAGGTGACAAGCTATTTTTCAATAAAGAATATCTTCCACTACTGCCATCTGGCGTC 

ATAACTGCAAAGTACACATATATTACGATGCTGTCTATTAAATGCTTCCTATATTATATA 

TATAGTAATGTCGTTTATGGTGCACTCTCAGTACAATCTGCTCTGATGCCGCATAGTTAA 

GCCAGCCCCGACACCCGCCAACACCCGCTGACGCGCCCTGACGGGCTTGTCTGCTCCCGG 

CATCCGCTTACAGACAAGCTGTGACCGTCTCCGGGAGCTGCATGTGTCAGAGGTTTTCAC 

CGTCATCACCGAAACGCGCGAGACGAAAGGGCCTCGTGATACGCCTATTTTTATAGGTTA 

ATGTCATGATAATAATGGTTTCTTAGGACGGATCGCTTGCCTGTAACTTACACGCGCCTC 

GTATCTTTTAATGATGGAATAATTTGGGAATTTACTCTGTGTTTATTTATTTTTATGTTT 

TGTATTTGGATTTTAGAAAGTAAATAAAGAAGGTAGAAGAGTTACGGAATGAAGAAAAAA 

AAATAAACAAAGGTTTAAAAAATTTCAACAAAAAGCGTACTTTACATATATATTTATTAG 

ACAAGAAAAGCAGATTAAATAGATATACATTCGATTAACGATAAGTAAAATGTAAAATCA 

CAGGATTTTCGTGTGTGGTCTTCTACACAGACAAGATGAAACAATTCGGCATTAATACCT 

GAG AG C AGGAAG AG C AAGAT AAAAGGT AGT ATTTGTTGG CG AT C C C C CT AG AGT CT TT T A 

CATCTTCGGAAAACAAAAACTATTTTTTCTTTAATTTCTTTTTTTACTTTCTATTTTTAA 

TTTATATATTTATATTAAAAAATTTAAATTATAATTATTTTTATAGCACGTGATGAAAAG 

GACCCAGGTGGCACTTTTCGGGGAAATGTGCGCGGAACCCCTATTTGTTTATTTTTCTAA 

ATACATT C AAATATGT AT C CGCTCATGAGAC AATAACC CTGATAAATGCTT C AAT AAT AT 

TGAAAAAGGAAGAGTATGAGTATTCAACATTTCCGTGTCGCCCTTATTCCCTTTTTTGCG 

GCATTTTGCCTTCCTGTTTTTGCTCACCCAGAAACGCTGGTGAAAGTAAAAGATGCTGAA 

GAT C AGTTGGGTGCACGAGTGGGTTACATCGAACTGGATCTCAACAGCGGT AAGAT CCTT 

GAGAGTTTTCGCCCCGAAGAACGTTTTCCAATGATGAGCACTTTTAAAGTTCTGCTATGT 

GGCGCGGTATTATCCCGTATTGACGCCGGGCAAGAGCAACTCGGTCGCCGCATACACTAT 

TCTCAGAATGACTTGGTTGAGTACTCACCAGTCACAGAAAAGCATCTTACGGATGGCATG 

ACAGTAAGAGAATTATGCAGTGCTGCCATAACCATGAGTGATAACACTGCGGCCAACTTA 

CTTCTGACAACGATCGGAGGACCGAAGGAGCTAACCGCTTTTTTTCACAACATGGGGGAT 

CATGTAACTCGCCTTGATCGTTGGGAACCGGAGCTGAATGAAGCCATACCAAACGACGAG 

CGTGACACCACGATGCCTGTAGCAATGGCAACAACGTTGCGCAAACTATTAACTGGCGAA 

CTACTTACTCTAGCTTCCCGGCAACAATTAATAGACTGGATGGAGGCGGATAAAGTTGCA 

GGACCACTTCTGCGCTCGGCCCTTCCGGCTGGCTGGTTTATTGCTGATAAATCTGGAGCC 

GGTGAGCGTGGGTCTCGCGGTATCATTGCAGCACTGGGGCCAGATGGTAAGCCCTCCCGT 

ATCGTAGTTATCTACACGACGGGCAGTCAGGCAACTATGGATGAACGAAATAGACAGATC 

GCTGAGATAGGTGCCTCACTGATTAAGCATTGGTAACTGTCAGACCAAGTTTACTCATAT 

ATACTTTAGATTGATTTAAAACTTCATTTTTAATTTAAAAGGATCTAGGTGAAGATCCTT 

TTTGATAATCTCATGACCAAAATCCCTTAACGTGAGTTTTCGTTCCACTGAGCGTCAGAC 

CCCGTAGAAAAGATCAAAGGATCTTCTTGAGATCCTTTTTTTCTGCGCGTAATCTGCTGC 

TTGCAAACAAAAAAACCACCGCTACCAGCGGTGGTTTGTTTGCCGGATCAAGAGCTACCA 

ACTCTTTTTCCGAAGGTAACTGGCTTCAGCAGAGCGCAGATACCAAATACTGTCCTTCTA 

GTGTAGCCGTAGTTAGGCCACCACTTCAAGAACTCTGTAGCACCGCCTACATACCTCGCT 

CTGCTAATCCTGTTACCAGTGGCTGCTGCCAGTGGCGATAAGTCGTGTCTTACCGGGTTG 

GACTCAAGACGATAGTTACCGGATAAGGCGCAGCGGTCGGGCTGAACGGGGGGTTCGTGC- 
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ACACAGCCCAGCTTGGAGCGAACGACCTACACCGAACTGAGATACCTACAGCGTGAGCAT 

TGAGAAAGCGCCACGCTTCCCGAAGGGAGAAAGGCGGACAGGTATCCGGTAAGCGGCAGG 

GTCGGAACAGGAGAGCGCACGAGGGAGCTTCCAGGGGGGAACGCCTGGTATCTTTATAGT 

CCTGTCGGGTTTCGCCACCTCTGACTTGAGCGTCGATTTTTGTGATGCTCGTCAGGGGGG 

CCGAGCCTATGGAAAAACGCCAGCAACGCGGCCTTTTTACGGTTCCTGGCCTTTTGCTGG 

CCTTTTGCTCACATGTTCTTTCCTGCGTTATCCCCTGATTCTGTGGATAACCGTATTACC 

GCCTTTGAGTGAGCTGATACCGCTCGCCGCAGCCGAACGACCGAGCGCAGCGAGTCAGTG 

AGCGAGGAAGCGGAAGAGCGCCCAATACGCAAACCGCCTCTCCCCGCGCGTTGGCCGATT 

CATTAATGCAGCTGGCACGACAGGTTTCCCGACTGGAAAGCGGGCAGTGAGCGCAACGCA 

ATTAATGTGAGTTACCTCACTCATTAGGCACCCCAGGCTTTACACTTTATGCTTCCGGCT 

CCTATGTTGTGTGGAATTGTGAGCGGATAACAATTTCACACAGGAAACAGCTATGACCAT 

GATTACGCCAAGCTCGGAATTAACCCTCACTAAAGGGAACAAAAGCTGGGTACCGGGCCC 

C C C CT CGAG AT C CGGGAT CG AAG AAATGATGGT AAATG AAATAGGAAAT C AAGG AG C ATG 

AAGGCAAAAGACAAATATAAGGGTCGAACGAAAAATAAAGTGAAAAGTGTTGATATGATG 

TATTTGGCTTTGCGGCGCCGAAAAAACGAGTTTACGCAATTGCACAATCATGCTGACTCT 

GTGGCGGACCCGCGCTCTTGCCGGCCCGGCGATAACGCTGGGCGTGAGGCTGTGCCCGGC 

GGAGTTTTTTGCGCCTGCATTTTCCAAGGTTTACCCTGCGCTAAGGGGCGAGATTGGAGA 

AGCAATAAGAATGCCGGTTGGGGTTGCGATGATGACGACCACGACAACTGGTGTCATTAT 

TT AAGTTG C CG AAAG AAC CTG AGTG C AT TTG C AAC ATG AGT AT ACT AG AAG AAT GAG C C A 

AGACTTGCGAGACGCGAGTTTGCCGGTGGTGCGAACAATAGAGCGACCATGACCTTGAAG 

GTGAGACGCGCATAACCGCTAGAGTACTTTGAAGAGGAAACAGCAATAGGGTTGCTACCA 

GTATAAATAGACAGGTACATACAACACTGGAAATGGTTGTCTGTTTGAGTACGCTTTCAA 

TTCATTTGGGTGTGCACTTTATTATGTTACAATATGGAAGGGAACTTTACACTTCTCCTA 

TGCACATATATTAATTAAAGTCCAATGCTAGTAGAGAAGGGGGGTAACACCCCTCCGCGC 

TCTTTTCCGATTTTTTTCTAAACCGTGGAATATTTCGGATATCCTTTTGTTGTTTCCGGG 

TGTACAATATGGACTTCCTCTTTTCTGGCAACCAAACCCATACATCGGGATTCCTATAAT 

ACCTTCGTTGGTCTCCCTAACATGTAGGTGGCGGAGGGGAGATATACAATAGAACAGATA 

CCAGACAAGACATAATGGGCTAAACAAGACTACACCAATTACACTGCCTCATTGATGGTG 

GT AC AT AACG AACTAAT AC TGT AG C C CT AGACTTG AT AG C CAT CAT CAT AT CGAAGTTT C 

ACTACCCTTTTTCCATTTGCCATCTATTGAAGTAATAATAGGCGCATGCAACTTCTTTTC 

TTTTTTTTTCTTTTCTCTCTCCCCCGTTGTTGTCTCACCATATCCGCAATGACAAAAAAA 

ATGATGGAAGACACTAAAGGAAAAAATTAACGACAAAGACAGCACCAACAGATGTCGTTG 

TTCCAGAGCTGATGAGGGGTATCTTCGAACACACGAAACTTTTTCCTTCCTTCATTCACG 

CACACTACTCTCTAATGAGCAACGGTATACGGCCTTCCTTCCAGTTACTTGAATTTGAAA 

TAAAAAAAGTTTGCCGCTTTGCTATCAAGTATAAATAGACCTGCAATTATTAATCTTTTG 

TTTCCTCGTCATTGTTCTCGTTCCCTTTCTTCCTTGTTTCTTTTTCTGCACAATATTTCA 

AG C TAT AC C AAG CAT AC AAT C AACT C C AAG CTT ATG C C C AAG AAGAAG CGGAAGGT CT C G 

AGCGGCGCCAATTTTAATCAAAGTGGGAATATTGCTGATAGCTCATTGTCCTTCACTTTC ' 

ACTAACAGTAGCAACGGTCCGAACCTCATAACAACTCAAACAAATTCTCAAGCGCTTTCA 

CAACCAATTGCCTCCTCTAACGTTCATGATAACTTCATGAATAATGAAATCACGGCTAGT 

AAAATTGATGATGGTAATAATTCAAAACCACTGTCACCTGGTTGGACGGACCAAACTGCG 

TATAACGCGTTTGGAATCACTACAGGGATGTTTAATACCACTACAATGGATGATGTATAT 

AAC TAT CTATTCGATGATGAAGATACC C C AC CAAAC CC AAAAAAAGAGGGTGGGT CGAT C 

ACAAGTTTGTACAAAAAAGCAGGCTTGTCGACCCCGGGAATTCAGATCTACTAGTGCGGC 

CGCACGCGTACCCAGCTTTCTTGTACAAAGTGGTGACGTCGAGCTCCCTATAGTGAGTCG 

TATTACACTGGCCGTCGTTTTACAACGTCGTGACTGGGAAAACACCGGTGAGCTCTAAGT 

AAGTAACGGCCGCCACCGCGGTGGAGCTTTGGACTTCTTCGCCAGAGGTTTGGTCAAGTC 

TCCAATCAAGGTTGTCGGCTTGTCTACCTTGCCAGAAATTTACGAAAAGATGGAAAAGGG 

TCAAATCGTTGGTAGATACGTTGTTGACACTTCTAAATAAGCGAATTTCTTATGATTTAT 

GATTTTTATTATTAAATAAGTTATAAAAAAAATAAGTGTATACAAATTTTAAAGTGACTC 

TTAGGTTTTAAAACGAAAATTCTTGTTCTTGAGTAACTCTTTCCTGTAGGTCAGGTTGCT 

TTCTCAGdTATAGCATGAGGTCGCTCTTATTGACCACACCTCTACCGGCATGCCGAGCAA 

ATGCCTGCAAATCGCTCCCCATTTCACCCAATTGTAGATATGCTAACTCCAGCAATGAGT 

TGATGAATCTCGGTGTGTATTTTATGTCCTCAGAGGACAATACCTGTTGTAATCGTTCTT 

CCACACGGATCCGCATCAGGCGAAATTGTAAACGTTAATATTTTGTTAAAATTCGCGTTA 

AATATTTGTTAAATCAGCTCATTTTTTAACCAATAGGCCGAAATCGGCAAAATCCCTTAT 

AAATCAAAAGAATAGACCGAGATAGGGTTGAGTGTTGTTCCAGTTTGGAACAAGAGTCCA 

CTATTAAAGAACGTGGACTCCAACGTCAAAGGGCGAAAAACCGTCTATCAGGGCGATGGC- 
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CCACTACGTGAACCATCACCCTAATCAAGTTTTTTGGGGTCGAGGTGCCGTAAAGCACTA 
AATCGGAACCCTAAAGGGAGCCCCCGATTTAGAGCTTGACGGGGAAAGCCGGCGAACGTG 
GCGAGAAAGGAAGGGAAGAAAGCGAAAGGAGCGGGCGCTAGGGCGCTGGCAAGTGTAGCG 
GTCACGCTGGGCGTAACCACCACACCCGCCGCGCTTAATGCGCCGCTACAGGGCGCGTCC 
CATTCGCCATTCACTGCA 
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GACGAAAGGGCCTCGTGATACGCCTATTTTTATAGGTTAATGTCATGATAA.TAATGGTTT 

CTTAGGACGGATGGCTTGCCTGTAACTTACACGCGCCTCGTATCTTTTAATGATGGAATA 

ATTTGGGAATTTACTCTGTGTTTATTTATTTTTATGTTTTGTATTTGGATTTTAGAAAGT 

AAATAAAGAAGGTAGAAGAGTTACGGAATGAAGAAAAAAAAATAAACAAAGGTTTAAAAA 

AT TT C AAC AAAAAG CGT ACT TT AC AT AT ATATTTATTAG AC AAGAAAAG C AG ATT AAAT A 

GATATACATTCGATTAACGATAAGTAAAATGTAAAATCACAGGATTTTCGTGTGTGGTCT 

T CT AC AC AG AC AAG ATG AAAC AATT CGGCATTAAT AC CTG AG AG C AGG AAG AG C AAG AT A 

AAAGGTAGTATTTGTTGGCGATCCCCCTAGAGTCTTTTACATCTTCGGAAAACAAAAACT 

ATTTTTTCTTTAATTTCTTTTTTTACTTTCTATTTTTAATTTATATATTTATATTAAAAA 

ATTTAAATTATAATTATTTTTATAGCACGTGATGAAAAGGACCCAGGTGGCACTTTTCGG 

GGAAATGTG CG CGGAAC C C CTATTTGTTTATTTTT CTAAAT ACATTCAAATATGTAT C CG 

CTCATGAGACAATAACCCTGATAAATGCTTCAATAATATTGAAAAAGGAAGAGTATGAGT 

ATTCAACATTTCCGTGTCGCCCTTATTCCCTTTTTTGCGGCATTTTGCCTTCCTGTTTTT 

GCTCACCCAGAAACGCTGGTGAAAGTAAAAGATGCTGAAGATCAGTTGGGTGCACGAGTG 

GGTTACATCGAACTGGATCTCAACAGCGGTAAGATCCTTGAGAGTTTTCGCCCCGAAGAA 

CGTTTTCCAATGATGAGCACTTTTAAAGTTCTGCTATGTGGCGCGGTATTATCCCGTATT 

GACGCCGGGCAAGAGCAACTCGGTCGCCGCATACACTATTCTCAGAATGACTTGGTTGAG 

T AC T C AC C AGT C A C AG AAAAG CAT CTT ACGG ATGG C ATG AC AGT AAGAG AATTATGC AGT 

GCTGCCATAACCATGAGTGATAACACTGCGGCCAACTTACTTCTGACAACGATCGGAGGA 

CCGAAGGAGCTAACCGCTTTTTTTCACAACATGGGGGATCATGTAACTCGCCTTGATCGT 

TGGGAACCGGAGCTGAATGAAGCCATACCAAACGACGAGCGTGACACCACGATGCCTGTA 

GCAATGGCAACAACGTTGCGCAAACTATTAACTGGCGAACTACTTACTCTAGCTTCCCGG 

CAACAATTAATAGACTGGATGGAGGCGGATAAAGTTGCAGGACCACTTCTGCGCTCGGCC 

CTTCCGGCTGGCTGGTTTATTGCTGATAAATCTGGAGCCGGTGAGCGTGGGTCTCGCGGT 

ATCATTGCAGCACTGGGGCCAGATGGTAAGCCCTCCCGTATCGTAGTTATCTACACGACG 

GG C AGT C AGG CAACT ATGG ATGAACGAAATAGAC AGAT CG CTGAGATAGGTGC CT CACTG 

ATTAAGCATTGGTAACTGTCAGACCAAGTTTACTCATATATACTTTAGATTGATTTAAAA 

CTTCATTTTTAATTTAAAAGGATCTAGGTGAAGATCCTTTTTGATAATCTCATGACCAAA 

AT C C C TT AACGTGAGTTTT CGTT C CACTGAG CGT C AGAC C C CGT AGAAAAGAT CAAAGGA 

TCTTCTTGAGATCCTTTTTTTCTGCGCGTAATCTGCTGCTTGCAAACAAAAAAACCACCG 

CTACCAGCGGTGGTTTGTTTGCCGGATCAAGAGCTACCAACTCTTTTTCCGAAGGTAACT 

GG CTT CAGC AG AG CGCAGATACCAAATACTGTCCTTCTAGTGTAGCCGTAGTTAGGCCAC 

CACTTCAAGAACTCTGTAGCACCGCCTACATACCTCGCTCTGCTAATCCTGTTACCAGTG 

GCTGCTGCCAGTGGCGATAAGTCGTGTCTTACCGGGTTGGACTCAAGACGATAGTTACCG 

GATAAGGCGCAGCGGTCGGGCTGAACGGGGGGTTCGTGCACACAGCCCAGCTTGGAGCGA 

ACGACCTACACCGAACTGAGATACCTACAGCGTGAGCATTGAGAAAGCGCCACGCTTCCC 

GAAGGGAGAAAGGCGGACAGGTATCCGGTAAGCGGCAGGGTCGGAACAGGAGAGCGCACG 

AGGGAGCTTCCAGGGGGGAACGCCTGGTATCTTTATAGTCCTGTCGGGTTTCGCCACCTC 

TGACTTGAGCGTCGATTTTTGTGATGCTCGTCAGGGGGGCCGAGCCTATGGAAAAACGCC 

AGCAACGCGGCCTTTTTACGGTTCCTGGCCTTTTGCTGGCCTTTTGCTCACATGTTCTTT 

CCTGCGTTATCCCCTGATTCTGTGGATAACCGTATTACCGCCTTTGAGTGAGCTGATACC 

G CT C G C CG C AG C C G AACG AC CGAGCG C AG CGAGT CAGTG AG CGAGGAAG CGG AAGAG CG C 

CCAATACGCAAACCGCCTCTCCCCGCGCGTTGGCCGATTCATTAATGCAGCTGGCACGAC 

AGGTTTCCCGACTGGAAAGCGGGCAGTGAGCGCAACGCAATTAATGTGAGTTACCTCACT 

CATTAGGCACCCCAGGCTTTACACTTTATGCTTCCGGCTCCTATGTTGTGTGGAATTGTG 

AGCGGATAACAATTTCACACAGGAAACAGCTATGACCATGATTACGCCAAGCTCGGAATT 

AACCCTCACTAAAGGGAACAAAAGCTGGGTACCGGGCCCCCCCTCGAGATCCGGGATCGA 

AGAAATGATGGTAAATGAAATAGGAAATCAAGGAGCATGAAGGCAAAAGACAAATATAAG 

GGTCGAACGAAAAATAAAGTGAAAAGTGTTGATATGATGTATTTGGCTTTGCGGCGCCGA 

AAAAACGAGTTTACGCAATTGCACAATCATGCTGACTCTGTGGCGGACCCGCGCTCTTGC 

CGGCCCGGCGATAACGCTGGGCGTGAGGCTGTGCCCGGCGGAGTTTTTTGCGCCTGCATT 

TTCCAAGGTTTACCCTGCGCTAAGGGGCGAGATTGGAGAAGCAATAAGAATGCCGGTTGG 

GGTTGCGATGATGACGACCACGACAACTGGTGTCATTATTTAAGTTGCCGAAAGAACCTG 

AGT G C ATTT G C AAC ATG AGT AT ACT AG AAGAATG AG C C AAG AC T TG CG AG AC G CG AGTTT 

GCCGGTGGTGCGAACAATAGAGCGACCATGACCTTGAAGGTGAGACGCGCATAACCGCTA- 
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GAGTACTTTGAAGAGGAAACAGCAATAGGGTTGCTACCAGTATAAATAGACAGGTACATA 

CAACACTGGAAATGGTTGTCTGTTTGAGTACGCTTTCAATTCATTTGGGTGTGCACTTTA 

TTATGTTACAATATGGAAGGGAACTTTACACTTCTCCTATGCACATATATTAATTAAAGT 

CCAATGCTAGTAGAGAAGGGGGGTAACACCCCTCCGCGCTCTTTTCCGATTTTTTTCTAA 

ACCGTGGAATATTTCGGATATCCTTTTGTTGTTTCCGGGTGTACAATATGGACTTCCTCT 

TTTCTGGCAACCAAACCCATACATCGGGATTCCTATAATACCTTCGTTGGTCTCCCTAAC 

ATGTAGGTGGCGGAGGGGAGATATACAATAGAACAGATACCAGACAAGACATAATGGGCT 

AAACAAGACTACACCAATTACACTGCCTCATTGATGGTGGTACATAACGAACTAATACTG 

TAGCCCTAGACTTGATAGCCATCATCATATCGAAGTTTCACTACCCTTTTTCCATTTGCC 

ATCTATTGAAGTAATAATAGGCGCATGCAACTTCTTTTCTTTTTTTTTCTTTTCTCTCTC 

CCCCGTTGTTGTCTCACCATATCCGCAATGACAAAAAAAATGATGGAAGACACTAAAGGA 

AAAAAT T AACG AC AAAGAC AGC AC C AAC AGATGT CGTTGTT C C AG AG CTG ATGAGGGGT A 

TCTTCGAACACACGAAACTTTTTCCTTCCTTCATTCACGCACACTACTCTCTAATGAGCA 

ACGGTATACGGCCTTCCTTCCAGTTACTTGAATTTGAAATAAAAAAAGTTTGCCGCTTTG 

CTATCAAGTATAAATAGACCTGCAATTATTAATCTTTTGTTTCCTCGTCATTGTTCTCGT 

TCCCTTTCTTCCTTGTTTCTTTTTCTGCACAATATTTCAAGCTATACCAAGCATACAATC 

AACT C C AAG CTT ATG C C C AAGAAGAAG CGGAAGGT CT CG AG CGGCG C C AATTTT AAT C AA 

AGTGGGAATATTGCTGATAGCTCATTGTCCTTCACTTTCACTAACAGTAGCAACGGTCCG 

AACCTCATAACAACTCAAACAAATTCTCAAGCGCTTTCACAACCAATTGCCTCCTCTAAC 

GTTCATGATAACTTCATGAATAATGAAATCACGGCTAGTAAAATTGATGATGGTAATAAT 

T CAAAAC CACTGT C ACC TGGTTGGACGGAC C AAACTGCGTATAACGCGTTTGGAAT C AC T 

ACAGGGATGTTTAATACCACTACAATGGATGATGTATATAACTATCTATTCGATGATGAA 

GAT AC C C C AC C AAAC C C AAAAAAAG AGGGTGGGT CGAT C AC AAGTTTGT AC AAAAAAG C A 

GGCTTGTCGACCCCGGGAATTCAGATCTACTAGTGCGGCCGCACGCGTACCCAGCTTTCT 

TGTACAAAGTGGTGACGTCGAGCTCTAAGTAAGTAACGGCCGCCACCGCGGTGGAGCTTT 

GGACTTCTTCGCCAGAGGTTTGGTCAAGTCTCCAATCAAGGTTGTCGGCTTGTCTACCTT 

GCCAGAAATTTACGAAAAGATGGAAAAGGGTCAAATCGTTGGTAGATACGTTGTTGACAC 

TTCTAAATAAGCGAATTTCTTATGATTTATGATTTTTATTATTAAATAAGTTATAAAAAA 

AATAAGTGTATACAAATTTTAAAGTGACTCTTAGGTTTTAAAACGAAAATTCTTGTTCTT 

GAGTAACTCTTTCCTGTAGGTCAGGTTGCTTTCTCAGGTATAGCATGAGGTCGCTCTTAT 

TGACCACACCTCTACCGGCATGCCGAGCAAATGCCTGCAAATCGCTCCCCATTTCACCCA 

ATTGTAGATATGCTAACTCCAGCAATGAGTTGATGAATCTCGGTGTGTATTTTATGTCCT 

CAGAGGACAATACCTGTTGTAATCGTTCTTCCACACGGATCCCAATTCGCCCTATAGTGA 

GTCGTATTACAATTCACTGGCCGTCGTTTTACAACGTCGTGACTGGGAAAACCCTGGCGT 

TACCCAACTTAATCGCCTTGCAGCACATCCCCCTTTCGCCAGCTGGCGTAATAGCGAAGA 

GGCCCGCACCGATCGCCCTTCCCAACAGTTGCGCAGCCTGAATGGCGAATGGACGCGCCC 

TGTAGCGGCGCATTAAGCGCGGCGGGTGTGGTGGTTACGCGCAGCGTGACCGCTACACTT 

GCCAGCGCCCTAGCGCCCGCTCCTTTCGCTTTCTTCCCTTCCTTTCTCGCCACGTTCGCC 

GGCTTTCCCCGTCAAGCTCTAAATCGGGGGCTCCCTTTAGGGTTCCGATTTAGTGCTTTA 

CGGCACCTCGACCCCAAAAAACTTGATTAGGGTGATGGTTCACGTAGTGGGCCATCGCCC 

TGATAGACGGTTTTTCGCCCTTTGACGTTGGAGTCCACGTTCTTTAATAGTGGACTCTTG 

TTCCAAACTGGAACAACACTCAACCCTATCTCGGTCTATTCTTTTGATTTATAAGGGATT 

TTGCCGATTTCGGCCTATTGGTTAAAAAATGAGCTGATTTAACAAAAATTTAACGCGAAT 

TTTAACAAAATATTAACGTTTACAATTTCCTGATGCGGTATTTTCTCCTTACGCATCTGT 

GCGGTATTTCACACCGCAGGCAAGTGCACAAACAATACTTAAATAAATACTACTCAGTAA 

TAACCTATTTCTTAGCATTTTTGACGAAATTTGCTATTTTGTTAGAGTCTTTTACACCAT 

TTGTCTCCACACCTCCGCTTACATCAACACCAATAACGCCATTTAATCTAAGCGCATCAC 

CAACATTTTCTGGCGTCAGTCCACCAGCTAACATAAAATGTAAGCTTTCGGGGCTCTCTT 

GCCTTCCAACCCAGTCAGAAATCGAGTTCCAATCCAAAAGTTCACCTGTCCCACCTGCTT 

CTGAATCAAACAAGGGAATAAACGAATGAGGTTTCTGTGAAGCTGCACTGAGTAGTATGT 

TG C AGT CTTTTGG AAATACGAGTCTTTT AAT AACTGGC AAAC CGAGG AACT CTTGGT ATT 

CTTGCCACGACTCATCTCCATGCAGTTGGACGATATCAATGCCGTAATCATTGACCAGAG 

C CAAAAC AT C CT C CTTAGGT TG ATT ACG AAAC ACGC C AAC C AAGTATTTCGGAGTG C CTG 

AACTATTTTTATATGCTTTTACAAGACTTGAAATTTTCCTTGCAATAACCGGGTCAATTG 

TTCTCTTTCTATTGGGCACACATATAATACCCAGCAAGTCAGCATCGGAATCTAGAGCAC 

ATTCTGCGGCCTCTGTGCTCTGCAAGCCGCAAACTTTCACCAATGGACCAGAACTACCTG 

TGAAATTAATAACAGACATACTCCAAGCTGCCTTTGTGTGCTTAATCACGTATACTCACG 

TGCTCAATAGTCACCAATGCCCTCCCTCTTGGCCCTCTCCTTTTCTTTTTTCGACCGAAT- 



WO 00/52027 



PCT/US00/05432 



TAATTCTTAATCGGCAAAAAAAGAAAAGCTCCGGATCAAGATTGTACGTAAGGTGACAAG 
CTATTTTTCAATAAAGAATATCTTCCACTACTGCCATCTGGCGTCATAACTGCAAAGTAC 
ACATATATTACGATGCTGTCTATTAAATGCTTCCTATATTATATATATAGTAATGTCGTT 
TATGGTGCACTCTCAGTACAATCTGCTCTGATGCCGCATAGTTAAGCCAGCCCCGACACC 
CGCCAACACCCGCTGACGCGCCCTGACGGGCTTGTCTGCTCCCGGCATCCGCTTACAGAC 
AAGCTGTGACCGTCTGCGGGAGCTGCATGTGTCAGAGGTTTTCACCGTCATCACCGAAAC 

GCGCGA 
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INDICATIONS RELATING TO A DEPOSITED MICROORGANISM^ 

(PCTRule I3bis) 1 ^ ELU 



A. The indications made below relate to the microorganism referred to in the description on page 54 ^ iin e 



B. IDENTIFICATION OF DEPOSIT 



Further deposits are identified on an additional sheet 



Name of depositary institution 

Agricultural Research Culture Collection (NRRL) 
International Depository Authority 



Address of depositary institution (including postal code and country) 

1815 N. University Street 
Peoria, Illinois 61604 
United States of America 



Date of deposit February 27, 1999 



Accession Number 



NRRLB-30103 



C. ADDITIONAL INDICATIONS (leave blank if not applicable) 



This information is continued on an additional sheet D 



Escherichia coli DB3.1(pEZC15101) 

In respect of those designations in which a European Patent is sought a sample of the deposited microorganism will be made 
available until the publication of the mention of the grant of the European patent or until the date on which the application has been 
refused or withdrawn or is deemed to be withdrawn, only by the issue of such a sample to an expert nominated by the person 
requesting the sample (Rule 28(4) EPC). 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE (if the indications are not for all designated States) 



E. SEPARATE FURNISHING OF INDICATIONS (leave blank if not applicable) 



The indications listed below will be submitted to the international Bureau later (specify the general nature of the indications, e.g., 
"Accession Number of Deposit") 



For receiving Office use only 



If/This sheet was received with the international application 



Authorized officer 





/ 



<2K 




For International Bureau use only 



□ This sheet was received by the International Bureau on: 



Authorized officer 



Form PCT/RO/ 134 (July 1992) 



mo-30103.ep 
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INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 

(PCTRule 13bis) 



A. The indications made below relate to the microorganism referred to in the description on page 55 . line 
16 


B. IDENTIFICATION OF DEPOSIT 


Further deposits are identified on an additional sheet ^ 


Name of depositary institution 




Agricultural Research Culture Collection (NRRL) 
International Depository Authority 




Address of depositary institution (including postal code and country) 


1815 N. University Street 
Peoria, Illinois 61604 
United States of America 




Date of deposit February 27, 1999 


Accession Number NRRL B-30100 



C. ADDITIONAL INDICATIONS (leave blank if not applicable) information is continued on an additional sheet □ 



Escherichia coli DB3.1(pENTR-lA) 

In respect of those designations in which a European Patent is sought a sample of the deposited microorganism will be made 
available until the publication of the mention of the grant of the European patent or until the date on which the application has been 
refused or withdrawn or is deemed to be withdrawn, only by the issue of such a sample to an expert nominated by the person 
requesting the sample (Rule 28(4) EPC). 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE (if the indications are not for all designated States) 



E. SEPARATE FURNISHING OF INDICATIONS (leave blank if not applicable) 



The indications listed below will be submitted to the international Bureau later (specify the general nature of the indications, e.g., 
"Accession Number of Deposit") 



For receiving Office use only 



This sheet was received with the international application 



Authorized officer 





For International Bureau use only 



□ This sheet was received bv the International Bureau on: 



Authorized officer 



Form PCT/RO/134 Uuly 1992) 



mo-30100.ep 



PCT/US00/05432 



INDICATIONS RELATING TO A DEPOSITED MICRO ORG ANI^T^ 

(PCT Rule 13Wj) 



A. The indications made below relate to the microorganism referred to in the description on pase 55 . line 
16 


B. IDENTIFICATION OF DEPOSIT 


Further deposits are identified on an additional sheet ^ 


Name of depositary institution 




Agricultural Research Culture Collection (NRRL) 
International Depository Authority 




Address of depositary institution (including postal code and country) 


1815 N. University Street 
Peoria, Illinois 61604 
United States of America 




Date of deposit February 27, 1999 


Accession Number NRRL B-30102 



C. ADDITIONAL INDICATIONS (leave blank if not applicable) Th is infor mation is continued on an additional sheet □ 



Escherichia coli DB3.1(pENTR-3C) 

In respect of those designations in which a European Patent is sought a sample of the deposited microorganism will be made 
available until the publication of the mention of the grant of the European patent or until the date on which the application has been 
refused or withdrawn or is deemed to be withdrawn, only by the issue of such a sample to an expert nominated by the person 
requesting the sample (Rule 28(4) EPC). 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE (if the indications are not for all designated States) 



E. SEPARATE FURNISHING OF INDICATIONS (leave blank if not applicable) 



The indications listed below will be submitted to the international Bureau later (specify the general nature of the indications, e.g., 
"Accession Number of Deposit") 



For receiving Office use only 



Ml 



This sheet was received with the international application 



Authorized officer 



For international Bureau use only 



□ This sheet was received by the International Bureau on: 



Authorized officer 



Form PCT/RO/134 (July 1992) 



mo-30102.ep 



PCT/US00/05432 



INDICATIONS RELATING TO A DEPOSITED MICRO OR&Wftlvl 

(PCTRule 13bis) 



A. The indications made below relate to the microorganism referred to in the description on page 55 
16 



_, line 



B. IDENTIFICATION OF DEPOSIT 



Further deposits are identified on an additional sheet 



Name of depositary institution 

Agricultural Research Culture Collection (NRRL) 
International Depository Authority 



Address of depositary institution (including postal code and country) 

1815 N. University Street 
Peoria, Illinois 61604 
United States of America 



Date of deposit February 27, 1999 



Accession Number 



NRRLB-30101 



C. ADDITIONAL INDICATIONS (leave blank if not applicable) 



This information is continued on an additional sheet □ 



Escherichia coli DB3 . 1 (pENTR-2B) 

In respect of those designations in which a European Patent is sought a sample of the deposited microorganism will be made 
available until the publication of the mention of the grant of the European patent or until the date on which the application has been 
refused or withdrawn or is deemed to be withdrawn, only by the issue of such a sample to an expert nominated by the person 
requesting the sample (Rule 28(4) EPC). 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE (if the indications are not for all designated States) 



E. SEPARATE FURNISHING OF INDICATIONS (leave blank if not applicable) 



The indications listed below will be submitted to the international Bureau later (specify the general nature of the indications, e.g., 
"Accession Number of Deposit") 



For receiving Office use only 



is sheet was received with the international application 



Authorized officer 




For International Bureau use only 



□ This sheet was received bv the International Bureau on: 



Authorized officer 



Form PCT/RO/134 (July 1992) 



mo-30101.ep 



PCT/US00/05432 



INDICATIONS RELATING TO A DEPOSITED MICRO ORG aWiSM" 

(PCTRule I3bis) 



A. The indications made below relate to the microorganism referred to in the description on pa^ 
20-21 . 




B. IDENTIFICATION OF DEPOSIT 



Further deposits are identified on an additional sheet S 



Name of depositary institution 



Agricultural Research Culture Collection (NRRL) 
International Depository Authority 

Address of depositary institution (including postal code and country) 



1815 N. University Street 
Peoria, Illinois 61604 
United States of America 



Date of deposit February 27, 1999 


Accession Number NRRL B-3 0 1 08 


C. ADDITIONAL INDICATIONS (leave blank if not applicable) xhi s information is continued on an additional sheet □ 



Escherichia coli DB10B(pCMVSport6) 



In respect of those designations in which a European Patent is sought a sample of the deposited microorganism will be made 
available until the publication of the mention of the grant of the European patent or until the date on which the application has been 
refused or withdrawn or is deemed to be withdrawn, only by the issue of such a sample to an expert nominated by the person 
requesting the sample (Rule 28(4) EPC). 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE (if the indications are not for all designated States) 



E. SEPARATE FURNISHING OF INDICATIONS (leave blank if not applicable) 

The indications listed below will be submitted to the international Bureau later (specify the general nature of the indications, e.g., 
"Accession Number of Deposit") 



For receiving Office use only 



sheet was received with the international application 



Authorized officer 




For International Bureau use only 



□ This sheet was received by the International Bureau on: 



Authorized officer 



Form PCTYRO/134 (July 1992) 



mo-30108.ep 



PCT/US00/05432 



INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 

(PCTRule 13 bis) 



A. The indications made beiow relate to the microorganism referred to in the description on page 54 
9 



_, line 



B. IDENTIFICATION OF DEPOSIT 



Further deposits are identified on an additional sheet 



Name of depositary institution 

Agricultural Research Culture Collection (NRRL) 
International Depository Authority 



Address of depositary institution (including postal code and country) 

1815 N. University Street 
Peoria, Illinois 61604 
United States of America 



Date of deposit February 27, 1999 


Accession Number NRRL B-30 1 05 


C. ADDITIONAL INDICATIONS (leave blank if not applicable) TOs information is continued on „ additjona , she£t D 



Escherichia coli DB3.1(pEZC15103) 

In respect of those designations in which a European Patent is sought a sample of the deposited microorganism will be made 
available until the publication of the mention of the grant of the European patent or until the date on which the application has been 
refused or withdrawn or is deemed to be withdrawn, only by the issue of such a sample to an expert nominated by the person 
requesting the sample (Rule 28(4) EPC). 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE (if the indications are not for all designated States) 



E. SEPARATE FURNISHING OF INDICATIONS (leave blank if not applicable) 



The indications listed below will be submitted to the international Bureau later (specify the general nature of the indications e.g 
"A ccession Number of Deposit ") 



For receiving Office use only 



(zKru 



This sheet was received with the international application 



Authorized officer 




-77 



For International Bureau use only 



□ This sheet was received by the International Bureau on: 



Authorized officer 



Form PCT/RO/134 (July 1992) 



mo-30I05.ep 



PCT/US00/05432 



INDICATIONS RELATING TO A DEPOSITED MICROpftfigPJSlvT 

(PCT Rule 13bis) 



A. The indications made below relate to the microorganism referred to in the description on page 54 line 

9 . 



B. IDENTIFICATION OF DEPOSIT 



Name of depositary institution 



Further deposits are identified on an additional sheet 



Agricultural Research Culture Collection (NRRL) 
International Depository Authority 

Address of depositary institution (including postal code and country) 



1815 N. University Street 
Peoria, Illinois 6 1 604 
United States of America 



Date of deposit February 27, 1999 



Accession Number 



NRRLB-30104 



C. ADDITIONAL INDICATIONS (leave blank if not applicable) ms informatlon 1S continued on an additlQnal sheet D 

Escherichia coli DB3.1(pEZC15102) 



In respect of those designations in which a European Patent is sought a sample of the deposited microorganism will be made 
available until the publication of the mention of the grant of the European patent or until the date on which the application has been 
refused or withdrawn or is deemed to be withdrawn, only by the issue of such a sample to an expert nominated by the person 
requesting the sample (Rule 28(4) EPC). 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE (if the indications are not for all designated States) 



E. SEPARATE FURNISHING OF INDICATIONS (leave blank if not applicable) 



The indications listed below will be submitted to the international Bureau later (specify the general nature of the indications, e.g., 
"Accession Number of Deposit ") 



For receiving Office use only 



sheet was received with the international application 



Authorized officer 




For international Bureau use only 



n This sheet was received by the International Bureau on: 



Authorized officer 



Form PCT/RO/134 (July 1992) 



mo-30104.ep 



PCT/US00/05432 



INDICATIONS RELATING TO A DEPOSITED MICROORgArBPDI 

(PCTRule I3bis) I 



A. The indications made below relate to the microorganism referred to in the description on page 52 . line 


B. IDENTIFICATION OF DEPOSIT 


Further deposits are identified on an additional sheet EI 


Name of depositary institution 




Agricultural Research Culture Collection (NRRL) 
International Depository Authority 


/o _ ^ 


! ft 'J i"^ °M 

Address of depositary institution (including postal code and country) \ u 0/ 

V A?/ 


1815 N. University Street 
Peoria, Illinois 61604 
United States of America 




Date of deposit February 2 7, 1999 


Accession Number NRRL B-30099 



C. ADDITIONAL INDICATIONS^We blank if not applicable) 



This information is continued on an additional sheet D 



Escherichia coli DB3 . 1 (p AHPKan) or Escherichia coli DB3.1(pAttPKan) 

In respect of those designations in which a European Patent is sought a sample of the deposited microorganism will be made 
available until the publication of the mention of the grant of the European patent or until the date on which the application has been 
refused or withdrawn or is deemed to be withdrawn, only by the issue of such a sample to an expert nominated by the person 
requesting the sample (Rule 28(4) EPC). 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE (if the indications are not for all designated States) 



E. SEPARATE FURNISHING OF INDICATIONS (leave blank if not applicable) 



The indications listed below will be submitted to the international Bureau later (specify the general nature of the indications, e.g. 
"Accession Number of Deposit") 







t^This sheet was received with the international application 


□ This sheet was received by the International Bureau on: 


Authorized officer Barbara Fridie "jft 7 

r -CT Operation ■ l^ETTeErri i 
7(W:-:o: -03; 305-3230 (FA*" 


Authorized officer 



Form PCT/RO/134 (July 1992) 



mo-30099.ep 



WO 00/52027 



PCT/US00/05432 



Escherichia coli DB3.1(pENTR-3C) 



ICELAND 

The applicant hereby requests that, until the application has been laid open to public inspection 
(by the Icelandic Patent Office), or has been finally decided upon by the Icelandic Patent Office 
without having been laid open to public inspection, the furnishing of a sample shall only be 
effected to an expert in the art. 

NETHERLANDS 

The applicant hereby requests that until the date of a grant of a Netherlands patent or until the 
date on which the application is refused or withdrawn or lapsed, the microorganism shall be 
made available as provided in Rule 31F(1) of the Patent Rules only by the issue of a sample to 
an expert. The request to this effect must be furnished by the applicant with the Netherlands 
Industrial Property Office before the date on which the application is made available to the public 
under Section 22C or Section 25 of the Patents Act of the Kingdom of the Netherlands, 
whichever of the two dates occurs earlier. 

NORWAY 

The applicant hereby requests that, until the application has been laid open to public inspection 
(by the Norwegian Patent Office), or has been finally decided upon by the Norwegian Patent 
Office without having been laid open to public inspection, the furnishing of a sample shall only 
be effected to an expert in the art. The request to this effect shall be filed by the applicant with 
the Norwegian Patent Office not later than at the time when the application is made available to 
the public under Sections 22 and 33(3) of the Norwegian Patents Act. If such a request has been 
filed by the applicant, any request made by a third party for the furnishing of a sample shall 
indicate the expert to be used. That expert may be any person entered on a list of recognized 
experts drawn up by the Norwegian Patent office or any person approved by the applicant in the 
individual case. 

SINGAPORE 

The applicant hereby requests that the furnishing of a sample of a microorganism shall only be 
made available to an expert. The request to this effect must be filed by the applicant with the 
International Bureau before the completion of the technical preparations for international 
publication of the application. 



WO 00/52027 



PCT/US00/05432 



Escherichia coli DB3.1(pENTR-3C) 
SWEDEN 

The applicant hereby requests that, until the application has been laid open to public inspection 
(by the Swedish Patent Office), or has been finally decided upon by the Swedish Patent Office 
without having been laid open to public inspection, the furnishing of a sample shall only be 
effected to an expert in the art. The request to this effect shall be filed by the applicant with the 
International Bureau before the expiration of 1 6 months from the priority date (preferably on the 
Form PCT/RO/1 34 reproduced in annex Z of Volume I of the PCT Applicant's Guide). If such 
a request has been filed by the applicant, any request made by a third party for the furnishing of 
a sample shall indicate the expert to be used. That expert may be any person entered on a list of 
recognized experts drawn up by the Swedish Patent office or any person approved by the 
applicant in the individual case. 

UNITED KINGDOM 

The applicant hereby requests that the furnishing of a sample of a microorganism shall only be 
made available to an expert. The request to this effect must be filed by the applicant with the 
International Bureau before the completion of the technical preparations for international 
publication of the application. 



WO 00/52027 



PCT/US00/05432 



Escherichia colt DB3.1(pENTR-2B) 
AUSTRALIA 

The applicant hereby gives notice that the furnishing of a sample of a microorganism shall only 
be effected prior to the grant of a patent, or prior to the lapsing, refusal or withdrawal of the 
application, to a person who is a skilled addressee without an interest in the invention 
(Regulation 3.25(3) of the Australian Patents Regulations). 

CANADA 

The applicant hereby requests that, until either a Canadian patent has been issued on the basis 
of the application or the application has been refused, or is abandoned and no longer subject to 
reinstatement, or is withdrawn, the furnishing of a sample of deposited biological material 
referred to in the application only be effected to an independent expert nominated by the 
Commissioner of Patents. 

DENMARK 

The applicant hereby requests that, until the application has been laid open to public inspection 
(by the Danish Patent Office), or has been finally decided upon by the Danish Patent Office 
without having been laid open to public inspection, the furnishing of a sample shall only be 
effected to an expert in the art. The request to this effect shall be filed by the applicant with the 
Danish Patent Office not later than at the time when the application is made available to the 
public under Sections 22 and 33(3) of the Danish Patents Act. If such a request has been filed 
by the applicant, any request made by a third party for the furnishing of a sample shall indicate 
the expert to be used. That expert may be any person entered on a list of recognized experts 
drawn up by the Danish Patent office or any person approved by the applicant in the individual 
case. 

FINLAND 

The applicant hereby requests that, until the application has been laid open to public inspection 
(by the National Board of Patents and Registration), or has been finally decided upon by the 
National Board of Patents and Registration without having been laid open to public inspection, 
the furnishing of a sample shall only be effected to an expert in the art. The request to this effect 
shall be filed by the applicant with the International Bureau before the expiration of 1 6 months 
from the priority date (preferably on the Form PCT/RO/134 reproduced in annex Z of Volume 
I of the PCT Applicant's Guide). If such a request has been filed by the applicant, any request 
made by a third party for the furnishing of a sample shall indicate the expert to be used. That 
expert may be any person entered on a list of recognized experts drawn up by the National Board 
of Patents and Registration or any person approved by the applicant in the individual case. 



WO 00/52027 



PCT/US00/05432 



Escherichia coli DB3.1(pENTR-2B) 



ICELAND 

The applicant hereby requests that, until the application has been laid open to public inspection 
(by the Icelandic Patent Office), or has been finally decided upon by the Icelandic Patent Office 
without having been laid open to public inspection, the furnishing of a sample shall only be 
effected to an expert in the art. 

NETHERLANDS 

The applicant hereby requests that until the date of a grant of a Netherlands patent or until the 
date on which the application is refused or withdrawn or lapsed, the microorganism shall be 
made available as provided in Rule 31F(1) of the Patent Rules only by the issue of a sample to 
an expert. The request to this effect must be furnished by the applicant with the Netherlands 
Industrial Property Office before the date on which the application is made available to the public 
under Section 22C or Section 25 of the Patents Act of the Kingdom of the Netherlands, 
whichever of the two dates occurs earlier. 

NORWAY 

The applicant hereby requests that, until the application has been laid open to public inspection 
(by the Norwegian Patent Office), or has been finally decided upon by the Norwegian Patent 
Office without having been laid open to public inspection, the furnishing of a sample shall only 
be effected to an expert in the art. The request to this effect shall be filed by the applicant with 
the Norwegian Patent Office not later than at the time when the application is made available to 
the public under Sections 22 and 33(3) of the Norwegian Patents Act. If such a request has been 
filed by the applicant, any request made by a third party for the furnishing of a sample shall 
indicate the expert to be used. That expert may be any person entered on a list of recognized 
experts drawn up by the Norwegian Patent office or any person approved by the applicant in the 
individual case. 

SINGAPORE 

The applicant hereby requests that the furnishing of a sample of a microorganism shall only be 
made available to an expert. The request to this effect must be filed by the applicant with the 
International Bureau before the completion of the technical preparations for international 
publication of the application. 



WO 00/52027 



PCT/US00/05432 



Escherichia coli DB3.1(pENTR-2B) 



SWEDEN 

The applicant hereby requests that, until the application has been laid open to public inspection 
(by the Swedish Patent Office), or has been finally decided upon by the Swedish Patent Office 
without having been laid open to public inspection, the furnishing of a sample shall only be 
effected to an expert in the art. The request to {his effect shall be filed by the applicant with the 
International Bureau before the expiration of 1 6 months from the priority date (preferably on the 
Form PCT/RO/1 34 reproduced in annex Z of Volume I of the PCT Applicant's Guide). If such 
a request has been filed by the applicant, any request made by a third party for the furnishing of 
a sample shall indicate the expert to be used. That expert may be any person entered on a list of 
recognized experts drawn up by the Swedish Patent office or any person approved by the 
applicant in the individual case. 

UNITED KINGDOM 

The applicant hereby requests that the furnishing of a sample of a microorganism shall only be 
made available to an expert. The request to this effect must be filed by the applicant with the 
International Bureau before the completion of the technical preparations for international 
publication of the application. 



WO 00/52027 



PCT/US00/05432 



Escherichia coli DB3.1(pENTR-lA) 
AUSTRALIA 

The applicant hereby gives notice that the furnishing of a sample of a microorganism shall only 
be effected prior to the grant of a patent, or prior to the lapsing, refusal or withdrawal of the 
application, to a person who is a skilled addressee without an interest in the invention 
(Regulation 3.25(3) of the Australian Patents Regulations). 

CANADA 

The applicant hereby requests that, until either a Canadian patent has been issued on the basis 
of the application or the application has been refused, or is abandoned and no longer subject to 
reinstatement, or is withdrawn, the furnishing of a sample of deposited biological material 
referred to in the application only be effected to an independent expert nominated by the 
Commissioner of Patents. 

DENMARK 

The applicant hereby requests that, until the application has been laid open to public inspection 
(by the Danish Patent Office), or has been finally decided upon by the Danish Patent Office 
without having been laid open to public inspection, the furnishing of a sample shall only be 
effected to an expert in the art. The request to this effect shall be filed by the applicant with the 
Danish Patent Office not later than at the time when the application is made available to the 
public under Sections 22 and 33(3) of the Danish Patents Act. If such a request has been filed 
by the applicant, any request made by a third party for the furnishing of a sample shall indicate 
the expert to be used. That expert may be any person entered on a list of recognized experts 
drawn up by the Danish Patent office or any person approved by the applicant in the individual 
case. 

FINLAND 

The applicant hereby requests that, until the application has been laid open to public inspection 
(by the National Board of Patents and Registration), or has been finally decided upon by the 
National Board of Patents and Registration without having been laid open to public inspection, 
the furnishing of a sample shall only be effected to an expert in the art. The request to this effect 
shall be filed by the applicant with the International Bureau before the expiration of 1 6 months 
from the priority date (preferably on the Form PCT/RO/134 reproduced in annex Z of Volume 
I of the PCT Applicant's Guide). If such a request has been filed by the applicant, any request 
made by a third party for the furnishing of a sample shall indicate the expert to be used. That 
expert may be any person entered on a list of recognized experts drawn up by the National Board 
of Patents and Registration or any person approved by the applicant in the individual case. 



WO 00/52027 



PCT/US00/05432 



Escherichia coli DB3A(pENTR-lA) 



ICELAND 

The applicant hereby requests that, until the application has been laid open to public inspection 
(by the Icelandic Patent Office), or has been finally decided upon by the Icelandic Patent Office 
without having been laid open to public inspfection, the furnishing of a sample shall only be 
effected to an expert in the art. 

NETHERLANDS 

The applicant hereby requests that until the date of a grant of a Netherlands patent or until the 
date on which the application is refused or withdrawn or lapsed, the microorganism shall be 
made available as provided in Rule 31F(1) of the Patent Rules only by the issue of a sample to 
an expert. The request to this effect must be furnished by the applicant with the Netherlands 
Industrial Property Office before the date on which the application is made available to the public 
under Section 22C or Section 25 of the Patents Act of the Kingdom of the Netherlands, 
whichever of the two dates occurs earlier. 

NORWAY 

The applicant hereby requests that, until the application has been laid open to public inspection 
(by the Norwegian Patent Office), or has been finally decided upon by the Norwegian Patent 
Office without having been laid open to public inspection, the furnishing of a sample shall only 
be effected to an expert in the art. The request to this effect shall be filed by the applicant with 
the Norwegian Patent Office not later than at the time when the application is made available to 
the public under Sections 22 and 33(3) of the Norwegian Patents Act. If such a request has been 
filed by the applicant, any request made by a third party for the furnishing of a sample shall 
indicate the expert to be used. That expert may be any person entered on a list of recognized 
experts drawn up by the Norwegian Patent office or any person approved by the applicant in the 
individual case. 

SINGAPORE 

The applicant hereby requests that the furnishing of a sample of a microorganism shall only be 
made available to an expert. The request to this effect must be filed by the applicant with the 
International Bureau before the completion of the technical preparations for international 
publication of the application. 



WO 00/52027 



PCT/US00/05432 



Escherichia coli DB3A(pENTR-lA) 
SWEDEN 

The applicant hereby requests that, until the application has been laid open to public inspection 
(by the Swedish Patent Office), or has been finally decided upon by the Swedish Patent Office 
without having been laid open to public inspection, the furnishing of a sample shall only be 
effected to an expert in the art. The request to this effect shall be filed by the applicant with the 
International Bureau before the expiration of 1 6 months from the priority date (preferably on the 
Form PCT/RO/134 reproduced in annex Z of Volume I of the PCT Applicant's Guide). If such 
a request has been filed by the applicant, any request made by a third party for the furnishing of 
a sample shall indicate the expert to be used. That expert may be any person entered on a list of 
recognized experts drawn up by the Swedish Patent office or any person approved by the 
applicant in the individual case. 

UNITED KINGDOM 

The applicant hereby requests that the furnishing of a sample of a microorganism shall only be 
made available to an expert. The request to this effect must be filed by the applicant with the 
International Bureau before the completion of the technical preparations for international 
publication of the application. 



WO 00/52027 



PCT/US00/05432 



Escherichia coli DB10B(pCMVSport6) 
AUSTRALIA 

The applicant hereby gives notice that the furnishing of a sample of a microorganism shall only 
be effected prior to the grant of a patent, or prior to the lapsing, refusal or withdrawal of the 
application, to a person who is a skilled addressee without an interest in the invention 
(Regulation 3.25(3) of the Australian Patents Regulations). 

CANADA 

The applicant hereby requests that, until either a Canadian patent has been issued on the basis 
of the application or the application has been refused, or is abandoned and no longer subject to 
reinstatement, or is withdrawn, the furnishing of a sample of deposited biological material 
referred to in the application only be effected to an independent expert nominated by the 
Commissioner of Patents. 

DENMARK 

The applicant hereby requests that, until the application has been laid open to public inspection 
(by the Danish Patent Office), or has been finally decided upon by the Danish Patent Office 
without having been laid open to public inspection, the furnishing of a sample shall only be 
effected to an expert in the art. The request to this effect shall be filed by the applicant with the 
Danish Patent Office not later than at the time when the application is made available to the 
public under Sections 22 and 33(3) of the Danish Patents Act, If such a request has been filed 
by the applicant, any request made by a third party for the furnishing of a sample shall indicate 
the expert to be used. That expert may be any person entered on a list of recognized experts 
drawn up by the Danish Patent office or any person approved by the applicant in the individual 
case. 

FINLAND 

The applicant hereby requests that, until the application has been laid open to public inspection 
(by the National Board of Patents and Registration), or has been finally decided upon by the 
National Board of Patents and Registration without having been laid open to public inspection, 
the furnishing of a sample shall only be effected to an expert in the art. The request to this effect 
shall be filed by the applicant with the International Bureau before the expiration of 1 6 months 
from the priority date (preferably on the Form PCT/RO/134 reproduced in annex Z of Volume 
I of the PCT Applicant's Guide). If such a request has been filed by the applicant, any request 
made by a third party for the furnishing of a sample shall indicate the expert to be used. That 
expert maybe any person entered on a list of recognized experts drawn up by the National Board 
of Patents and Registration or any person approved by the applicant in the individual case. 



WO 00/52027 



PCT/US00/05432 



Escherichia coli DB3A(pAHPKan) or Escherichia coli DB3.1(pAttPKatt) 
AUSTRALIA 

The applicant hereby gives notice that the furnishing of a sample of a microorganism shall only 
be effected prior to the grant of a patent, or prior to the lapsing, refusal or withdrawal of the 
application, to a person who is a skilled addressee without an interest in the invention 
(Regulation 3.25(3) of the Australian Patents Regulations). 

CANADA 

The applicant hereby requests that, until either a Canadian patent has been issued on the basis 
of the application or the application has been refused, or is abandoned and no longer subject to 
reinstatement, or is withdrawn, the furnishing of a sample of deposited biological material 
referred to in the application only be effected to an independent expert nominated by the 
Commissioner of Patents. 

DENMARK 

The applicant hereby requests that, until the application has been laid open to public inspection 
(by the Danish Patent Office), or has been finally decided upon by the Danish Patent Office 
without having been laid open to public inspection, the furnishing of a sample shall only be 
effected to an expert in the art. The request to this effect shall be filed by the applicant with the 
Danish Patent Office not later than at the time when the application is made available to the 
public under Sections 22 and 33(3) of the Danish Patents Act. If such a request has been filed 
by the applicant, any request made by a third party for the furnishing of a sample shall indicate 
the expert to be used. That expert may be any person entered on a list of recognized experts 
drawn up by the Danish Patent office or any person approved by the applicant in the individual 
case. 

FINLAND 

The applicant hereby requests that, until the application has been laid open to public inspection 
(by the National Board of Patents and Registration), or has been finally decided upon by the 
National Board of Patents and Registration without having been laid open to public inspection, 
the furnishing of a sample shall only be effected to an expert in the art. The request to this effect 
shall be filed by the applicant with the International Bureau before the expiration of 16 months 
from the priority date (preferably on the Form PCT/RO/134 reproduced in annex Z of Volume 
I of the PCT Applicant's Guide). If such a request has been filed by the applicant, any request 
made by a third party for the furnishing of a sample shall indicate the expert to be used. That 
expert may be any person entered on a list of recognized experts drawn up by the National Board 
of Patents and Registration or any person approved by the applicant in the individual case. 



WO 00/52027 PCT/US00/05432 
Escherichia coli DB3. 1 (pAHPKan) or Escherichia coli DB3.1(pAttPKarfy 



ICELAND 

The applicant hereby requests that, until the application has been laid open to public inspection 
(by the Icelandic Patent Office), or has been finally decided upon by the Icelandic Patent Office 
without having been laid open to public inspection, the furnishing of a sample shall only be 
effected to an expert in the art. 

NETHERLANDS 

The applicant hereby requests that until the date of a grant of a Netherlands patent or until the 
date on which the application is refused or withdrawn or lapsed, the microorganism shall be 
made available as provided in Rule 31F(1) of the Patent Rules only by the issue of a sample to 
an expert. The request to this effect must be furnished by the applicant with the Netherlands 
Industrial Property Office before the date on which the application is made available to the public 
under Section 22C or Section 25 of the Patents Act of the Kingdom of the Netherlands, 
whichever of the two dates occurs earlier. 

NORWAY 

The applicant hereby requests that, until the application has been laid open to public inspection 
(by the Norwegian Patent Office), or has been finally decided upon by the Norwegian Patent 
Office without having been laid open to public inspection, the furnishing of a sample shall only 
be effected to an expert in the art. The request to this effect shall be filed by the applicant with 
the Norwegian Patent Office not later than at the time when the application is made available to 
the public under Sections 22 and 33(3) of the Norwegian Patents Act. If such a request has been 
filed by the applicant, any request made by a third party for the furnishing of a sample shall 
indicate the expert to be used. That expert may be any person entered on a list of recognized 
experts drawn up by the Norwegian Patent office or any person approved by the applicant in the 
individual case. 

SINGAPORE 

The applicant hereby requests that the furnishing of a sample of a microorganism shall only be 
made available to an expert. The request to this effect must be filed by the applicant with the 
International Bureau before the completion of the technical preparations for international 
publication of the application. 



WO 00/52027 



PCT/US00/05432 



Escherichia coli DB3J(pAHPKan) or Escherichia coli DB3.1(pAttPKafy 
SWEDEN 

The applicant hereby requests that, until the application has been laid open to public inspection 
(by the Swedish Patent Office), or has been finally decided upon by the Swedish Patent Office 
without having been laid open to public inspection, the furnishing of a sample shall only be 
effected to an expert in the art. The request to this effect shall be filed by the applicant with the 
International Bureau before the expiration of 1 6 months from the priority date (preferably on the 
Form PCT/RO/134 reproduced in annex Z of Volume I of the PCT Applicant's Guide). If such 
a request has been filed by the applicant, any request made by a third party for the furnishing of 
a sample shall indicate the expert to be used. That expert may be any person entered on a list of 
recognized experts drawn up by the Swedish Patent office or any person approved by the 
applicant in the individual case. 

UNITED KINGDOM 

The applicant hereby requests that the furnishing of a sample of a microorganism shall only be 
made available to an expert. The request to this effect must be filed by the applicant with the 
International Bureau before the completion of the technical preparations for international 
publication of the application. 



WO 00/52027 



PCT7US00/05432 



Escherichia coli DB10B(pCMVSport6) 



ICELAND 

The applicant hereby requests that, until the application has been laid open to public inspection 
(by the Icelandic Patent Office), or has been finally decided upon by the Icelandic Patent Office 
without having been laid open to public inspection, the furnishing of a sample shall only be 
effected to an expert in the art. 

NETHERLANDS 

The applicant hereby requests that until the date of a grant of a Netherlands patent or until the 
date on which the application is refused or withdrawn or lapsed, the microorganism shall be 
made available as provided in Rule 3 1F(1) of the Patent Rules only by the issue of a sample to 
an expert. The request to this effect must be furnished by the applicant with the Netherlands 
Industrial Property Office before the date on which the application is made available to the public 
under Section 22C or Section 25 of the Patents Act of the Kingdom of the Netherlands, 
whichever of the two dates occurs earlier. 

NORWAY 

The applicant hereby requests that, until the application has been laid open to public inspection 
(by the Norwegian Patent Office), or has been finally decided upon by the Norwegian Patent 
Office without having been laid open to public inspection, the furnishing of a sample shall only 
be effected to an expert in the art. The request to this effect shall be filed by the applicant with 
the Norwegian Patent Office not later than at the time when the application is made available to 
the public under Sections 22 and 33(3) of the Norwegian Patents Act. If such a request has been 
filed by the applicant, any request made by a third party for the furnishing of a sample shall 
indicate the expert to be used. That expert may be any person entered on a list of recognized 
experts drawn up by the Norwegian Patent office or any person approved by the applicant in the 
individual case. 

SINGAPORE 

The applicant hereby requests that the furnishing of a sample of a microorganism shall only be 
made available to an expert. The request to this effect must be filed by the applicant with the 
International Bureau before the completion of the technical preparations for international 
publication of the application. 



WO 00/52027 



PCT/US00/05432 



Escherichia coli DB10B(pCMVSport6) 



SWEDEN 

The applicant hereby requests that, until the application has been laid open to public inspection 
(by the Swedish Patent Office), or has been finally decided upon by the Swedish Patent Office 
without having been laid open to public inspection, the furnishing of a sample shall only be 
effected to an expert in the art. The request to this effect shall be filed by the applicant with the 
International Bureau before the expiration of 1 6 months from the priority date (preferably on the 
Form PCT/RO/134 reproduced in annex Z of Volume I of the PCT Applicant's Guide). If such 
a request has been filed by the applicant, any request made by a third party for the furnishing of 
a sample shall indicate the expert to be used. That expert may be any person entered on a list of 
recognized experts drawn up by the Swedish Patent office or any person approved by the 
applicant in the individual case. 

UNITED KINGDOM 

The applicant hereby requests that the furnishing of a sample of a microorganism shall only be 
made available to an expert. The request to this effect must be filed by the applicant with the 
International Bureau before the completion of the technical preparations for international 
publication of the application. 



WO 00/52027 



PCT/US00/05432 



Escherichia coli DB3.1(pEZC15103) 
AUSTRALIA 

The applicant hereby gives notice that the furnishing of a sample of a microorganism shall only 
be effected prior to the grant of a patent, or prior to the lapsing, refusal or withdrawal of the 
application, to a person who is a skilled addressee without an interest in the invention 
(Regulation 3.25(3) of the Australian Patents Regulations). 

CANADA 

The applicant hereby requests that, until either a Canadian patent has been issued on the basis 
of the application or the application has been refused, or is abandoned and no longer subject to 
reinstatement, or is withdrawn, the furnishing of a sample of deposited biological material 
> referred to in the application only be effected to an independent expert nominated by the 

Commissioner of Patents. 

DENMARK 

The applicant hereby requests that, until the application has been laid open to public inspection 
(by the Danish Patent Office), or has been finally decided upon by the Danish Patent Office 
without having been laid open to public inspection, the furnishing of a sample shall only be 
effected to an expert in the art. The request to this effect shall be filed by the applicant with the 
Danish Patent Office not later than at the time when the application is made available to the 
public under Sections 22 and 33(3) of the Danish Patents Act. If such a request has been filed 
by the applicant, any request made by a third party for the furnishing of a sample shall indicate 
the expert to be used. That expert may be any person entered on a list of recognized experts 
drawn up by the Danish Patent office or any person approved by the applicant in the individual 
case. 

FINLAND 

The applicant hereby requests that, until the application has been laid open to public inspection 
(by the National Board of Patents and Registration), or has been finally decided upon by the 
National Board of Patents and Registration without having been laid open to public inspection, 
the furnishing of a sample shall only be effected to an expert in the art. The request to this effect 
shall be filed by the applicant with the International Bureau before the expiration of 16 months 
from the priority date (preferably on the Form PCT/RO/134 reproduced in annex Z of Volume 
I of the PCT Applicant's Guide). If such a request has been filed by the applicant, any request 
made by a third party for the furnishing of a sample shall indicate the expert to be used. That 
expert maybe any person entered on a list of recognized experts drawn up by the National Board 
of Patents and Registration or any person approved by the applicant in the individual case. 



WO 00/52027 
Escherichia coli DB3.1(pEZC15103) 



PCT/US00/05432 



ICELAND 

The applicant hereby requests that, until the application has been laid open to public inspection 
(by the Icelandic Patent Office), or has been finally decided upon by the Icelandic Patent Office 
without having been laid open to public inspection, the furnishing of a sample shall only be 
effected to an expert in the art. 

NETHERLANDS 

The applicant hereby requests that until the date of a grant of a Netherlands patent or until the 
date on which the application is refused or withdrawn or lapsed, the microorganism shall be 
made available as provided in Rule 31F(1) of the Patent Rules only by the issue of a sample to 
an expert. The request to this effect must be furnished by the applicant with the Netherlands 
Industrial Property Office before the date on which the application is made available to the public 
under Section 22C or Section 25 of the Patents Act of the Kingdom of the Netherlands, 
whichever of the two dates occurs earlier. 

NORWAY 

The applicant hereby requests that, until the application has been laid open to public inspection 
(by the Norwegian Patent Office), or has been finally decided upon by the Norwegian Patent 
Office without having been laid open to public inspection, the furnishing of a sample shall only 
be effected to an expert in the art. The request to this effect shall be filed by the applicant with 
the Norwegian Patent Office not later than at the time when the application is made available to 
the public under Sections 22 and 33(3) of the Norwegian Patents Act. If such a request has been 
filed by the applicant, any request made by a third party for the furnishing of a sample shall 
indicate the expert to be used. That expert may be any person entered on a list of recognized 
experts drawn up by the Norwegian Patent office or any person approved by the applicant in the 
individual case. 

SINGAPORE 

The applicant hereby requests that the furnishing of a sample of a microorganism shall only be 
made available to an expert. The request to this effect must be filed by the applicant with the 
International Bureau before the completion of the technical preparations for international 
publication of the application. 



WO 00/52027 



PCT/US00/05432 



Escherichia coli DB3.1(pEZC15103) 
SWEDEN 

The applicant hereby requests that, until the application has been laid open to public inspection 
(by the Swedish Patent Office), or has been finally decided upon by the Swedish Patent Office 
without having been laid open to public inspection, the furnishing of a sample shall only be 
effected to an expert in the art. The request to this effect shall be filed by the applicant with the 
International Bureau before the expiration of 1 6 months from the priority date (preferably on the 
Form PCT/RO/134 reproduced in annex Z of Volume I of the PCT Applicant's Guide). If such 
a request has been filed by the applicant, any request made by a third party for the furnishing of 
a sample shall indicate the expert to be used. That expert may be any person entered on a list of 
recognized experts drawn up by the Swedish Patent office or any person approved by the 
applicant in the individual case. 

UNITED KINGDOM 

The applicant hereby requests that the furnishing of a sample of a microorganism shall only be 
made available to an expert. The request to this effect must be filed by the applicant with the 
International Bureau before the completion of the technical preparations for international 
publication of the application. 
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Escherichia coli DB3A(pEZC15102) 
AUSTRALIA 

The applicant hereby gives notice that the furnishing of a sample of a microorganism shall only 
be effected prior to the grant of a patent, or prior to the lapsing, refusal or withdrawal of the 
application, to a person who is a skilled addressee without an interest in the invention 
(Regulation 3.25(3) of the Australian Patents Regulations). 

CANADA 

The applicant hereby requests that, until either a Canadian patent has been issued on the basis 
of the application or the application has been refused, or is abandoned and no longer subject to 
reinstatement, or is withdrawn, the famishing of a sample of deposited biological material 
referred to in the application only be effected to an independent expert nominated by the 
Commissioner of Patents. 

DENMARK 

The applicant hereby requests that, until the application has been laid open to public inspection 
(by the Danish Patent Office), or has been finally decided upon by the Danish Patent Office 
without having been laid open to public inspection, the furnishing of a sample shall only be 
effected to an expert in the art. The request to this effect shall be filed by the applicant with the 
Danish Patent Office not later than at the time when the application is made available to the 
public under Sections 22 and 33(3) of the Danish Patents Act. If such a request has been filed 
by the applicant, any request made by a third party for the furnishing of a sample shall indicate 
the expert to be used. That expert may be any person entered on a list of recognized experts 
drawn up by the Danish Patent office or any person approved by the applicant in the individual 
case. 

FINLAND 

The applicant hereby requests that, until the application has been laid open to public inspection 
(by the National Board of Patents and Registration), or has been finally decided upon by the 
National Board of Patents and Registration without having been laid open to public inspection, 
the furnishing of a sample shall only be effected to an expert in the art. The request to this effect 
shall be filed by the applicant with the International Bureau before the expiration of 16 months 
from the priority date (preferably on the Form PCT/RO/1 34 reproduced in annex Z of Volume 
I of the PCT Applicant's Guide). If such a request has been filed by the applicant, any request 
made by a third party for the furnishing of a sample shall indicate the expert to be used. That 
expert may be any person entered on a list of recognized experts drawn up by the National Board 
of Patents and Registration or any person approved by the applicant in the individual case. 



WO 00/52027 



PCT/US00/05432 



Escherichia coli DB3.1(pEZC15102) 



ICELAND 

The applicant hereby requests that, until the application has been laid open to public inspection 
(by the Icelandic Patent Office), or has been finally decided upon by the Icelandic Patent Office 
without having been laid open to public inspection, the furnishing of a sample shall only be 
effected to an expert in the art. 

NETHERLANDS 

The applicant hereby requests that until the date of a grant of a Netherlands patent or until the 
date on which the application is refused or withdrawn or lapsed, the microorganism shall be 
made available as provided in Rule 31F(1) of the Patent Rules only by the issue of a sample to 
an expert. The request to this effect must be furnished by the applicant with the Netherlands 
Industrial Property Office before the date on which the application is made available to the public 
under Section 22C or Section 25 of the Patents Act of the Kingdom of the Netherlands, 
whichever of the two dates occurs earlier. 

NORWAY 

The applicant hereby requests that, until the application has been laid open to public inspection 
(by the Norwegian Patent Office), or has been finally decided upon by the Norwegian Patent 
Office without having been laid open to public inspection, the furnishing of a sample shall only 
be effected to an expert in the art. The request to this effect shall be filed by the applicant with 
the Norwegian Patent Office not later than at the time when the application is made available to 
the public under Sections 22 and 33(3) of the Norwegian Patents Act. If such a request has been 
filed by the applicant, any request made by a third party for the furnishing of a sample shall 
indicate the expert to be used. That expert may be any person entered on a list of recognized 
experts drawn up by the Norwegian Patent office or any person approved by the applicant in the 
individual case. 

SINGAPORE 

The applicant hereby requests that the furnishing of a sample of a microorganism shall only be 
made available to an expert. The request to this effect must be filed by the applicant with the 
International Bureau before the completion of the technical preparations for international 
publication of the application. 
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PCT/US00/05432 



Escherichia coli DB3A(pEZC15102) 



SWEDEN 

The applicant hereby requests that, until the application has been laid open to public inspection 
(by the Swedish Patent Office), or has been finally decided upon by the Swedish Patent Office 
without having been laid open to public inspection, the furnishing of a sample shall only be 
effected to an expert in the art. The request to this effect shall be filed by the applicant with the 
International Bureau before the expiration of 1 6 months from the priority date (preferably on the 
Form PCT/RO/134 reproduced in annex Z of Volume I of the PCT Applicant's Guide). If such 
a request has been filed by the applicant, any request made by a third party for the furnishing of 
a sample shall indicate the expert to be used. That expert may be any person entered on a list of 
recognized experts drawn up by the Swedish Patent office or any person approved by the 
applicant in the individual case. 

UNITED KINGDOM 

The applicant hereby requests that the furnishing of a sample of a microorganism shall only be 
made available to an expert. The request to this effect must be filed by the applicant with the 
International Bureau before the completion of the technical preparations for international 
publication of the application. 
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Escherichia coli DB3.1(pEZC15101) 



AUSTRALIA 

The applicant hereby gives notice that the furnishing of a sample of a microorganism shall only 
be effected prior to the grant of a patent, or prior to the lapsing, refusal or withdrawal of the 
application, to a person who is a skilled addressee without an interest in the invention 
(Regulation 3.25(3) of the Australian Patents Regulations). 

CANADA 

The applicant hereby requests that, until either a Canadian patent has been issued on the basis 
of the application or the application has been refused, or is abandoned and no longer subject to 
reinstatement, or is withdrawn, the furnishing of a sample of deposited biological material 
referred to in the application only be effected to an independent expert nominated by the 
Commissioner of Patents. 

DENMARK 

The applicant hereby requests that, until the application has been laid open to public inspection 
(by the Danish Patent Office), or has been finally decided upon by the Danish Patent Office 
without having been laid open to public inspection, the furnishing of a sample shall only be 
effected to an expert in the art. The requesi to this effect shall be filed by the applicant with the 
Danish Patent Office not later than at the time when the application is made available to the 
public under Sections 22 and 33(3) of the Danish Patents Act. If such a request has been filed 
by the applicant, any request made by a third party for the furnishing of a sample shall indicate 
the expert to be used. That expert may be any person entered on a list of recognized experts 
drawn up by the Danish Patent office or any person approved by the applicant in the individual 
case. 

FINLAND 

The applicant hereby requests that, until the application has been laid open to public inspection 
(by the National Board of Patents and Registration), or has been finally decided upon by the 
National Board of Patents and Registration without having been laid open to public inspection, 
the furnishing of a sample shall only be effected to an expert in the art. The request to this effect 
shall be filed by the applicant with the International Bureau before the expiration of 1 6 months 
from the priority date (preferably on the Form PCT/RO/134 reproduced in annex Z of Volume 
I of the PCT Applicant's Guide). If such a request has been filed by the applicant, any request 
made by a third party for the furnishing of a sample shall indicate the expert to be used. That 
expert may be any person entered on a list of recognized experts drawn up by the National Board 
of Patents and Registration or any person approved by the applicant in the individual case. 



WO 00/52027 



PCT/US00/05432 



Escherichia coli DB3.1(pEZC15101) 



ICELAND 

The applicant hereby requests that, until the application has been laid open to public inspection 
(by the Icelandic Patent Office), or has been finally decided upon by the Icelandic Patent Office 
without having been laid open to public inspection, the furnishing of a sample shall only be 
effected to an expert in the art. 

NETHERLANDS 

The applicant hereby requests that until the date of a grant of a Netherlands patent or until the 
date on which the application is refused or withdrawn or lapsed, the microorganism shall be 
made available as provided in Rule 31F(1) of the Patent Rules only by the issue of a sample to 
an expert. The request to this effect must be furnished by the applicant with the Netherlands 
Industrial Property Office before the date on which the application is made available to the public 
under Section 22C or Section 25 of the Patents Act of the Kingdom of the Netherlands, 
whichever of the two dates occurs earlier. 

NORWAY 

The applicant hereby requests that, until the application has been laid open to public inspection 
(by the Norwegian Patent Office), or has been finally decided upon by the Norwegian Patent 
Office without having been laid open to public inspection, the furnishing of a sample shall only 
be effected to an expert in the art. The request to this effect shall be filed by the applicant with 
the Norwegian Patent Office not later than at the time when the application is made available to 
the public under Sections 22 and 33(3) of the Norwegian Patents Act. If such a request has been 
filed by the applicant, any request made by a third party for the furnishing of a sample shall 
indicate the expert to be used. That expert may be any person entered on a list of recognized 
experts drawn up by the Norwegian Patent office or any person approved by the applicant in the 
individual case. 

SINGAPORE 

The applicant hereby requests that the furnishing of a sample of a microorganism shall only be 
made available to an expert. The request to this effect must be filed by the applicant with the 
International Bureau before the completion of the technical preparations for international 
publication of the application. 



WO 00/52027 



PCT/US00/05432 



Escherichia coli DB3.1(pEZC15101) 
SWEDEN 

The applicant hereby requests that, until the application has been laid open to public inspection 
(by the Swedish Patent Office), or has been finally decided upon by the Swedish Patent Office 
without having been laid open to public inspection, the furnishing of a sample shall only be 
effected to an expert in the art. The request to this effect shall be filed by the applicant with the 
International Bureau before the expiration of 1 6 months from the priority date (preferably on the 
Form PCT/RO/134 reproduced in annex Z of Volume I of the PCT Applicant's Guide). If such 
a request has been filed by the applicant, any request made by a third party for the furnishing of 
a sample shall indicate the expert to be used. That expert may be any person entered on a list of 
recognized experts drawn up by the Swedish Patent office or any person approved by the 
applicant in the individual case. 

UNITED KINGDOM 

The applicant hereby requests that the furnishing of a sample of a microorganism shall only be 
made available to an expert. The request to this effect must be filed by the applicant with the 
International Bureau before the completion of the technical preparations for international 
publication of the application. 
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PCT/US00/05432 



Escherichia coli DB3.1(pENTR-3C) 



AUSTRALIA 

The applicant hereby gives notice that the furnishing of a sample of a microorganism shall only 
be effected prior to the grant of a patent, or prior to the lapsing, refusal or withdrawal of the 
application, to a person who is a skilled addressee without an interest in the invention 
(Regulation 3.25(3) of the Australian Patents Regulations). 

CANADA 

The applicant hereby requests that, until either a Canadian patent has been issued on the basis 
of the application or the application has been refused, or is abandoned and no longer subject to 
reinstatement, or is withdrawn, the furnishing of a sample of deposited biological material 
referred to in the application only be effected to an independent expert nominated by the 
Commissioner of Patents. 

DENMARK 

The applicant hereby requests that, until the application has been laid open to public inspection 
(by the Danish Patent Office), or has been finally decided upon by the Danish Patent Office 
without having been laid open to public inspection, the furnishing of a sample shall only be 
effected to an expert in the art. The request to this effect shall be filed by the applicant with the 
Danish Patent Office not later than at the time when the application is made available to the 
public under Sections 22 and 33(3) of the Danish Patents Act. If such a request has been filed 
by the applicant, any request made by a third party for the furnishing of a sample shall indicate 
the expert to be used. That expert may be any person entered on a list of recognized experts 
drawn up by the Danish Patent office or any person approved by the applicant in the individual 
case. 

FINLAND 

The applicant hereby requests that, until the application has been laid open to public inspection 
(by the National Board of Patents and Registration), or has been finally decided upon by the 
National Board of Patents and Registration without having been laid open to public inspection, 
the furnishing of a sample shall only be effected to an expert in the art. The request to this effect 
shall be filed by the applicant with the International Bureau before the expiration of 16 months 
from the priority date (preferably on the Form PCT/RO/134 reproduced in annex Z of Volume 
I of the PCT Applicant's Guide). If such a request has been filed by the applicant, any request 
made by a third party for the furnishing of a sample shall indicate the expert to be used. That 
expert maybe any person entered on a list of recognized experts drawn up by the National Board 
of Patents and Registration or any person approved by the applicant in the individual case. 
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